Dataset statistics
| Number of variables | 49 |
|---|---|
| Number of observations | 158957 |
| Missing cells | 1251928 |
| Missing cells (%) | 16.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 59.4 MiB |
| Average record size in memory | 392.0 B |
Variable types
| Numeric | 25 |
|---|---|
| Categorical | 23 |
| Unsupported | 1 |
CITY has constant value "WASHINGTON" | Constant |
STATE has constant value "DC" | Constant |
SALEDATE has a high cardinality: 6937 distinct values | High cardinality |
FULLADDRESS has a high cardinality: 105978 distinct values | High cardinality |
NATIONALGRID has a high cardinality: 105949 distinct values | High cardinality |
ASSESSMENT_NBHD has a high cardinality: 57 distinct values | High cardinality |
ASSESSMENT_SUBNBHD has a high cardinality: 121 distinct values | High cardinality |
CENSUS_BLOCK has a high cardinality: 3848 distinct values | High cardinality |
Unnamed: 0 is highly correlated with ROOMS and 1 other fields | High correlation |
BATHRM is highly correlated with ROOMS and 3 other fields | High correlation |
NUM_UNITS is highly correlated with ROOMS and 2 other fields | High correlation |
ROOMS is highly correlated with Unnamed: 0 and 6 other fields | High correlation |
BEDRM is highly correlated with Unnamed: 0 and 4 other fields | High correlation |
AYB is highly correlated with EYB | High correlation |
YR_RMDL is highly correlated with CMPLX_NUM | High correlation |
EYB is highly correlated with AYB | High correlation |
PRICE is highly correlated with GBA | High correlation |
GBA is highly correlated with BATHRM and 4 other fields | High correlation |
KITCHENS is highly correlated with NUM_UNITS and 2 other fields | High correlation |
FIREPLACES is highly correlated with GBA | High correlation |
USECODE is highly correlated with NUM_UNITS and 1 other fields | High correlation |
CMPLX_NUM is highly correlated with YR_RMDL | High correlation |
LIVING_GBA is highly correlated with BATHRM and 2 other fields | High correlation |
LATITUDE is highly correlated with CENSUS_TRACT and 1 other fields | High correlation |
LONGITUDE is highly correlated with CENSUS_TRACT and 1 other fields | High correlation |
CENSUS_TRACT is highly correlated with LATITUDE and 3 other fields | High correlation |
X is highly correlated with LONGITUDE and 1 other fields | High correlation |
Y is highly correlated with LATITUDE and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with ROOMS and 3 other fields | High correlation |
BATHRM is highly correlated with ROOMS and 3 other fields | High correlation |
NUM_UNITS is highly correlated with KITCHENS and 1 other fields | High correlation |
ROOMS is highly correlated with Unnamed: 0 and 5 other fields | High correlation |
BEDRM is highly correlated with Unnamed: 0 and 5 other fields | High correlation |
PRICE is highly correlated with LIVING_GBA | High correlation |
GBA is highly correlated with BATHRM and 2 other fields | High correlation |
KITCHENS is highly correlated with NUM_UNITS | High correlation |
USECODE is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
LANDAREA is highly correlated with Unnamed: 0 and 2 other fields | High correlation |
LIVING_GBA is highly correlated with BATHRM and 3 other fields | High correlation |
LATITUDE is highly correlated with LONGITUDE and 3 other fields | High correlation |
LONGITUDE is highly correlated with LATITUDE and 3 other fields | High correlation |
CENSUS_TRACT is highly correlated with LATITUDE and 3 other fields | High correlation |
X is highly correlated with LATITUDE and 3 other fields | High correlation |
Y is highly correlated with LATITUDE and 3 other fields | High correlation |
Unnamed: 0 is highly correlated with ROOMS | High correlation |
BATHRM is highly correlated with ROOMS and 3 other fields | High correlation |
NUM_UNITS is highly correlated with KITCHENS | High correlation |
ROOMS is highly correlated with Unnamed: 0 and 5 other fields | High correlation |
BEDRM is highly correlated with BATHRM and 4 other fields | High correlation |
GBA is highly correlated with BATHRM and 2 other fields | High correlation |
KITCHENS is highly correlated with NUM_UNITS | High correlation |
LANDAREA is highly correlated with ROOMS and 1 other fields | High correlation |
LIVING_GBA is highly correlated with BATHRM and 2 other fields | High correlation |
LATITUDE is highly correlated with Y | High correlation |
LONGITUDE is highly correlated with CENSUS_TRACT and 1 other fields | High correlation |
CENSUS_TRACT is highly correlated with LONGITUDE and 1 other fields | High correlation |
X is highly correlated with LONGITUDE and 1 other fields | High correlation |
Y is highly correlated with LATITUDE | High correlation |
ROOMS is highly correlated with BATHRM and 9 other fields | High correlation |
AC is highly correlated with EXTWALL and 5 other fields | High correlation |
QUADRANT is highly correlated with Y and 7 other fields | High correlation |
EXTWALL is highly correlated with AC and 4 other fields | High correlation |
FIREPLACES is highly correlated with YR_RMDL | High correlation |
STRUCT is highly correlated with AC and 8 other fields | High correlation |
BATHRM is highly correlated with ROOMS and 4 other fields | High correlation |
ROOF is highly correlated with STRUCT and 3 other fields | High correlation |
Y is highly correlated with QUADRANT and 8 other fields | High correlation |
LANDAREA is highly correlated with ROOMS and 3 other fields | High correlation |
CMPLX_NUM is highly correlated with Y and 5 other fields | High correlation |
AYB is highly correlated with EYB and 3 other fields | High correlation |
SOURCE is highly correlated with ROOMS and 10 other fields | High correlation |
STYLE is highly correlated with STRUCT and 2 other fields | High correlation |
HF_BATHRM is highly correlated with ROOMS and 1 other fields | High correlation |
WARD is highly correlated with QUADRANT and 12 other fields | High correlation |
LATITUDE is highly correlated with QUADRANT and 8 other fields | High correlation |
CNDTN is highly correlated with AC and 4 other fields | High correlation |
KITCHENS is highly correlated with NUM_UNITS | High correlation |
PRICE is highly correlated with GBA | High correlation |
USECODE is highly correlated with STRUCT and 1 other fields | High correlation |
EYB is highly correlated with AC and 6 other fields | High correlation |
HEAT is highly correlated with AC and 4 other fields | High correlation |
LIVING_GBA is highly correlated with ROOMS and 2 other fields | High correlation |
YR_RMDL is highly correlated with FIREPLACES | High correlation |
BEDRM is highly correlated with ROOMS and 5 other fields | High correlation |
GBA is highly correlated with ROOMS and 5 other fields | High correlation |
CENSUS_TRACT is highly correlated with QUADRANT and 12 other fields | High correlation |
GRADE is highly correlated with AC and 11 other fields | High correlation |
X is highly correlated with QUADRANT and 10 other fields | High correlation |
Unnamed: 0 is highly correlated with ROOMS and 14 other fields | High correlation |
ASSESSMENT_NBHD is highly correlated with ROOMS and 21 other fields | High correlation |
NUM_UNITS is highly correlated with STRUCT and 2 other fields | High correlation |
GIS_LAST_MOD_DTTM is highly correlated with ROOMS and 10 other fields | High correlation |
LONGITUDE is highly correlated with QUADRANT and 10 other fields | High correlation |
ROOF is highly correlated with CITY and 3 other fields | High correlation |
AC is highly correlated with HEAT and 2 other fields | High correlation |
QUADRANT is highly correlated with CITY and 3 other fields | High correlation |
HEAT is highly correlated with AC and 4 other fields | High correlation |
CITY is highly correlated with ROOF and 16 other fields | High correlation |
INTWALL is highly correlated with CITY and 3 other fields | High correlation |
ASSESSMENT_NBHD is highly correlated with QUADRANT and 5 other fields | High correlation |
STATE is highly correlated with ROOF and 16 other fields | High correlation |
BLDG_NUM is highly correlated with CITY and 1 other fields | High correlation |
QUALIFIED is highly correlated with CITY and 1 other fields | High correlation |
GIS_LAST_MOD_DTTM is highly correlated with ROOF and 11 other fields | High correlation |
STYLE is highly correlated with CITY and 3 other fields | High correlation |
EXTWALL is highly correlated with CITY and 3 other fields | High correlation |
STRUCT is highly correlated with CITY and 3 other fields | High correlation |
SOURCE is highly correlated with ROOF and 11 other fields | High correlation |
WARD is highly correlated with QUADRANT and 3 other fields | High correlation |
GRADE is highly correlated with CITY and 3 other fields | High correlation |
CNDTN is highly correlated with CITY and 3 other fields | High correlation |
NUM_UNITS has 52261 (32.9%) missing values | Missing |
YR_RMDL has 78029 (49.1%) missing values | Missing |
STORIES has 52305 (32.9%) missing values | Missing |
SALEDATE has 26770 (16.8%) missing values | Missing |
PRICE has 60741 (38.2%) missing values | Missing |
GBA has 52261 (32.9%) missing values | Missing |
STYLE has 52261 (32.9%) missing values | Missing |
STRUCT has 52261 (32.9%) missing values | Missing |
GRADE has 52261 (32.9%) missing values | Missing |
CNDTN has 52261 (32.9%) missing values | Missing |
EXTWALL has 52261 (32.9%) missing values | Missing |
ROOF has 52261 (32.9%) missing values | Missing |
INTWALL has 52261 (32.9%) missing values | Missing |
KITCHENS has 52262 (32.9%) missing values | Missing |
CMPLX_NUM has 106696 (67.1%) missing values | Missing |
LIVING_GBA has 106696 (67.1%) missing values | Missing |
FULLADDRESS has 52917 (33.3%) missing values | Missing |
CITY has 52906 (33.3%) missing values | Missing |
STATE has 52906 (33.3%) missing values | Missing |
NATIONALGRID has 52906 (33.3%) missing values | Missing |
ASSESSMENT_SUBNBHD has 32551 (20.5%) missing values | Missing |
CENSUS_BLOCK has 52906 (33.3%) missing values | Missing |
YR_RMDL is highly skewed (γ1 = -21.69324411) | Skewed |
STORIES is highly skewed (γ1 = 228.6851767) | Skewed |
FIREPLACES is highly skewed (γ1 = 398.5490354) | Skewed |
LANDAREA is highly skewed (γ1 = 78.59012056) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
FULLADDRESS is uniformly distributed | Uniform |
NATIONALGRID is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
SQUARE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
HF_BATHRM has 93148 (58.6%) zeros | Zeros |
BEDRM has 5297 (3.3%) zeros | Zeros |
FIREPLACES has 103837 (65.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-07-08 05:46:53.159994 |
|---|---|
| Analysis finished | 2021-07-08 05:51:09.919361 |
| Duration | 4 minutes and 16.76 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 158957 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79478 |
| Minimum | 0 |
|---|---|
| Maximum | 158956 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7947.8 |
| Q1 | 39739 |
| median | 79478 |
| Q3 | 119217 |
| 95-th percentile | 151008.2 |
| Maximum | 158956 |
| Range | 158956 |
| Interquartile range (IQR) | 79478 |
Descriptive statistics
| Standard deviation | 45887.07771 |
|---|---|
| Coefficient of variation (CV) | 0.5773557174 |
| Kurtosis | -1.2 |
| Mean | 79478 |
| Median Absolute Deviation (MAD) | 39739 |
| Skewness | 0 |
| Sum | 1.263358445 × 1010 |
| Variance | 2105623900 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 7465 | 1 | < 0.1% |
| 54576 | 1 | < 0.1% |
| 11567 | 1 | < 0.1% |
| 9518 | 1 | < 0.1% |
| 15661 | 1 | < 0.1% |
| 13612 | 1 | < 0.1% |
| 3371 | 1 | < 0.1% |
| 1322 | 1 | < 0.1% |
| 5416 | 1 | < 0.1% |
| Other values (158947) | 158947 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 158956 | 1 | |
| 158955 | 1 | |
| 158954 | 1 | |
| 158953 | 1 | |
| 158952 | 1 | |
| 158951 | 1 | |
| 158950 | 1 | |
| 158949 | 1 | |
| 158948 | 1 | |
| 158947 | 1 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.81067836 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 58 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9763959589 |
|---|---|
| Coefficient of variation (CV) | 0.5392431813 |
| Kurtosis | 3.893857213 |
| Mean | 1.81067836 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.514664494 |
| Sum | 287820 |
| Variance | 0.9533490686 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 74555 | |
| 2 | 53325 | |
| 3 | 20785 | 13.1% |
| 4 | 8119 | 5.1% |
| 5 | 1367 | 0.9% |
| 6 | 500 | 0.3% |
| 7 | 129 | 0.1% |
| 8 | 71 | < 0.1% |
| 0 | 58 | < 0.1% |
| 9 | 22 | < 0.1% |
| Other values (5) | 26 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 58 | < 0.1% |
| 1 | 74555 | |
| 2 | 53325 | |
| 3 | 20785 | 13.1% |
| 4 | 8119 | 5.1% |
| 5 | 1367 | 0.9% |
| 6 | 500 | 0.3% |
| 7 | 129 | 0.1% |
| 8 | 71 | < 0.1% |
| 9 | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 3 | < 0.1% |
| 11 | 7 | < 0.1% |
| 10 | 14 | < 0.1% |
| 9 | 22 | < 0.1% |
| 8 | 71 | < 0.1% |
| 7 | 129 | 0.1% |
| 6 | 500 | 0.3% |
| 5 | 1367 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4582371333 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 93148 |
| Zeros (%) | 58.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5875714745 |
|---|---|
| Coefficient of variation (CV) | 1.282243257 |
| Kurtosis | 2.074616926 |
| Mean | 0.4582371333 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.074096595 |
| Sum | 72840 |
| Variance | 0.3452402376 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 93148 | |
| 1 | 59258 | |
| 2 | 6186 | 3.9% |
| 3 | 289 | 0.2% |
| 4 | 56 | < 0.1% |
| 5 | 12 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 3 | < 0.1% |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 93148 | |
| 1 | 59258 | |
| 2 | 6186 | 3.9% |
| 3 | 289 | 0.2% |
| 4 | 56 | < 0.1% |
| 5 | 12 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 3 | < 0.1% |
| 5 | 12 | < 0.1% |
| 4 | 56 | < 0.1% |
| 3 | 289 | 0.2% |
| 2 | 6186 | 3.9% |
| 1 | 59258 | |
| 0 | 93148 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| Forced Air | |
|---|---|
| Hot Water Rad | |
| Warm Cool | |
| Ht Pump | |
| Wall Furnace | 1120 |
| Other values (9) | 1623 |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 10.30165391 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1637520 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Warm Cool |
|---|---|
| 2nd row | Warm Cool |
| 3rd row | Hot Water Rad |
| 4th row | Hot Water Rad |
| 5th row | Warm Cool |
Common Values
| Value | Count | Frequency (%) |
| Forced Air | 53972 | |
| Hot Water Rad | 47202 | |
| Warm Cool | 33628 | |
| Ht Pump | 21412 | 13.5% |
| Wall Furnace | 1120 | 0.7% |
| Water Base Brd | 402 | 0.3% |
| Elec Base Brd | 351 | 0.2% |
| No Data | 330 | 0.2% |
| Electric Rad | 144 | 0.1% |
| Gravity Furnac | 140 | 0.1% |
| Other values (4) | 256 | 0.2% |
Length
| Value | Count | Frequency (%) |
| air | 54011 | |
| forced | 53972 | |
| water | 47604 | |
| rad | 47346 | |
| hot | 47202 | |
| cool | 33678 | |
| warm | 33628 | |
| pump | 21412 | 5.9% |
| ht | 21412 | 5.9% |
| furnace | 1120 | 0.3% |
| Other values (14) | 4367 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 206795 | ||
| r | 191629 | |
| o | 168860 | |
| a | 132511 | 8.1% |
| t | 116882 | 7.1% |
| e | 103944 | 6.3% |
| d | 102121 | 6.2% |
| W | 82352 | 5.0% |
| H | 68614 | 4.2% |
| c | 55910 | 3.4% |
| Other values (26) | 407902 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1064739 | |
| Uppercase Letter | 365869 | 22.3% |
| Space Separator | 206795 | 12.6% |
| Dash Punctuation | 117 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 191629 | |
| o | 168860 | |
| a | 132511 | |
| t | 116882 | |
| e | 103944 | |
| d | 102121 | |
| c | 55910 | 5.3% |
| m | 55040 | 5.2% |
| i | 54579 | 5.1% |
| l | 36530 | 3.4% |
| Other values (9) | 46733 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 82352 | |
| H | 68614 | |
| F | 55232 | |
| A | 54128 | |
| R | 47346 | |
| C | 33678 | |
| P | 21412 | 5.9% |
| B | 1506 | 0.4% |
| E | 584 | 0.2% |
| N | 330 | 0.1% |
| Other values (5) | 687 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 206795 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 117 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1430608 | |
| Common | 206912 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 191629 | |
| o | 168860 | |
| a | 132511 | 9.3% |
| t | 116882 | 8.2% |
| e | 103944 | 7.3% |
| d | 102121 | 7.1% |
| W | 82352 | 5.8% |
| H | 68614 | 4.8% |
| c | 55910 | 3.9% |
| F | 55232 | 3.9% |
| Other values (24) | 352553 |
Common
| Value | Count | Frequency (%) |
| 206795 | ||
| - | 117 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1637520 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 206795 | ||
| r | 191629 | |
| o | 168860 | |
| a | 132511 | 8.1% |
| t | 116882 | 7.1% |
| e | 103944 | 6.3% |
| d | 102121 | 6.2% |
| W | 82352 | 5.0% |
| H | 68614 | 4.2% |
| c | 55910 | 3.4% |
| Other values (26) | 407902 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| Y | |
|---|---|
| N | |
| 0 | 65 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 158957 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Y |
|---|---|
| 2nd row | Y |
| 3rd row | Y |
| 4th row | Y |
| 5th row | Y |
Common Values
| Value | Count | Frequency (%) |
| Y | 114620 | |
| N | 44272 | 27.9% |
| 0 | 65 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| y | 114620 | |
| n | 44272 | 27.9% |
| 0 | 65 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 114620 | |
| N | 44272 | 27.9% |
| 0 | 65 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 158892 | |
| Decimal Number | 65 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 114620 | |
| N | 44272 | 27.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 65 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 158892 | |
| Common | 65 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 114620 | |
| N | 44272 | 27.9% |
Common
| Value | Count | Frequency (%) |
| 0 | 65 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 158957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 114620 | |
| N | 44272 | 27.9% |
| 0 | 65 | < 0.1% |
NUM_UNITS
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.198039289 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 168 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5969244151 |
|---|---|
| Coefficient of variation (CV) | 0.4982511179 |
| Kurtosis | 12.38589718 |
| Mean | 1.198039289 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.467857332 |
| Sum | 127826 |
| Variance | 0.3563187574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 92491 | |
| 2 | 9864 | 6.2% |
| 4 | 3059 | 1.9% |
| 3 | 1101 | 0.7% |
| 0 | 168 | 0.1% |
| 5 | 10 | < 0.1% |
| 6 | 3 | < 0.1% |
| (Missing) | 52261 |
| Value | Count | Frequency (%) |
| 0 | 168 | 0.1% |
| 1 | 92491 | |
| 2 | 9864 | 6.2% |
| 3 | 1101 | 0.7% |
| 4 | 3059 | 1.9% |
| 5 | 10 | < 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 3 | < 0.1% |
| 5 | 10 | < 0.1% |
| 4 | 3059 | 1.9% |
| 3 | 1101 | 0.7% |
| 2 | 9864 | 6.2% |
| 1 | 92491 | |
| 0 | 168 | 0.1% |
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.187736306 |
| Minimum | 0 |
|---|---|
| Maximum | 48 |
| Zeros | 138 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 48 |
| Range | 48 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.618164876 |
|---|---|
| Coefficient of variation (CV) | 0.4231215984 |
| Kurtosis | 4.563615163 |
| Mean | 6.187736306 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.283358565 |
| Sum | 983584 |
| Variance | 6.854787319 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 37259 | |
| 7 | 22338 | |
| 4 | 20593 | |
| 3 | 17759 | |
| 5 | 16852 | |
| 8 | 16327 | |
| 9 | 7616 | 4.8% |
| 10 | 5909 | 3.7% |
| 2 | 5294 | 3.3% |
| 12 | 2929 | 1.8% |
| Other values (30) | 6081 | 3.8% |
| Value | Count | Frequency (%) |
| 0 | 138 | 0.1% |
| 1 | 96 | 0.1% |
| 2 | 5294 | 3.3% |
| 3 | 17759 | |
| 4 | 20593 | |
| 5 | 16852 | |
| 6 | 37259 | |
| 7 | 22338 | |
| 8 | 16327 | |
| 9 | 7616 | 4.8% |
| Value | Count | Frequency (%) |
| 48 | 1 | |
| 41 | 1 | |
| 40 | 1 | |
| 39 | 2 | |
| 37 | 1 | |
| 35 | 1 | |
| 34 | 1 | |
| 32 | 1 | |
| 31 | 1 | |
| 30 | 1 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.732506275 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 5297 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.358864244 |
|---|---|
| Coefficient of variation (CV) | 0.497295928 |
| Kurtosis | 2.951014382 |
| Mean | 2.732506275 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7307726738 |
| Sum | 434351 |
| Variance | 1.846512034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 57864 | |
| 2 | 34946 | |
| 4 | 24893 | |
| 1 | 24181 | |
| 5 | 6898 | 4.3% |
| 0 | 5297 | 3.3% |
| 6 | 3090 | 1.9% |
| 8 | 792 | 0.5% |
| 7 | 750 | 0.5% |
| 9 | 123 | 0.1% |
| Other values (10) | 123 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5297 | 3.3% |
| 1 | 24181 | |
| 2 | 34946 | |
| 3 | 57864 | |
| 4 | 24893 | |
| 5 | 6898 | 4.3% |
| 6 | 3090 | 1.9% |
| 7 | 750 | 0.5% |
| 8 | 792 | 0.5% |
| 9 | 123 | 0.1% |
| Value | Count | Frequency (%) |
| 24 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 3 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 4 | < 0.1% |
| 12 | 34 | |
| 11 | 13 | < 0.1% |
| 10 | 62 |
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 271 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1941.987579 |
| Minimum | 1754 |
|---|---|
| Maximum | 2019 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 1754 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1918 |
| median | 1937 |
| Q3 | 1960 |
| 95-th percentile | 2007 |
| Maximum | 2019 |
| Range | 265 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 33.64023358 |
|---|---|
| Coefficient of variation (CV) | 0.01732257916 |
| Kurtosis | -0.07799373814 |
| Mean | 1941.987579 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 0.5112530606 |
| Sum | 308166241 |
| Variance | 1131.665315 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1900 | 8967 | 5.6% |
| 1925 | 5129 | 3.2% |
| 1910 | 4563 | 2.9% |
| 1940 | 4316 | 2.7% |
| 1923 | 3724 | 2.3% |
| 1927 | 3707 | 2.3% |
| 1941 | 3420 | 2.2% |
| 1926 | 3117 | 2.0% |
| 1942 | 3058 | 1.9% |
| 1939 | 2849 | 1.8% |
| Other values (210) | 115836 |
| Value | Count | Frequency (%) |
| 1754 | 2 | |
| 1765 | 1 | < 0.1% |
| 1776 | 3 | |
| 1780 | 4 | |
| 1782 | 1 | < 0.1% |
| 1784 | 1 | < 0.1% |
| 1785 | 1 | < 0.1% |
| 1787 | 1 | < 0.1% |
| 1788 | 1 | < 0.1% |
| 1790 | 3 |
| Value | Count | Frequency (%) |
| 2019 | 1 | < 0.1% |
| 2018 | 98 | 0.1% |
| 2017 | 592 | |
| 2016 | 1016 | |
| 2015 | 906 | |
| 2014 | 683 | |
| 2013 | 794 | |
| 2012 | 438 | |
| 2011 | 389 | 0.2% |
| 2010 | 380 | 0.2% |
| Distinct | 110 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 78029 |
| Missing (%) | 49.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1998.243537 |
| Minimum | 20 |
|---|---|
| Maximum | 2019 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 1973 |
| Q1 | 1985 |
| median | 2004 |
| Q3 | 2010 |
| 95-th percentile | 2016 |
| Maximum | 2019 |
| Range | 1999 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 16.57578569 |
|---|---|
| Coefficient of variation (CV) | 0.008295177927 |
| Kurtosis | 2506.352909 |
| Mean | 1998.243537 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -21.69324411 |
| Sum | 161713853 |
| Variance | 274.7566711 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2006 | 5029 | 3.2% |
| 2005 | 4937 | 3.1% |
| 2004 | 3985 | 2.5% |
| 2007 | 3771 | 2.4% |
| 1980 | 3310 | 2.1% |
| 2003 | 2951 | 1.9% |
| 2011 | 2856 | 1.8% |
| 2008 | 2766 | 1.7% |
| 1978 | 2690 | 1.7% |
| 2010 | 2680 | 1.7% |
| Other values (100) | 45953 | |
| (Missing) | 78029 |
| Value | Count | Frequency (%) |
| 20 | 1 | |
| 1880 | 2 | |
| 1900 | 2 | |
| 1910 | 1 | |
| 1911 | 1 | |
| 1912 | 1 | |
| 1913 | 1 | |
| 1915 | 1 | |
| 1916 | 2 | |
| 1917 | 2 |
| Value | Count | Frequency (%) |
| 2019 | 1 | < 0.1% |
| 2018 | 417 | 0.3% |
| 2017 | 1991 | |
| 2016 | 2190 | |
| 2015 | 2595 | |
| 2014 | 2645 | |
| 2013 | 2561 | |
| 2012 | 2648 | |
| 2011 | 2856 | |
| 2010 | 2680 |
| Distinct | 135 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1963.718024 |
| Minimum | 1800 |
|---|---|
| Maximum | 2018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 1800 |
|---|---|
| 5-th percentile | 1919 |
| Q1 | 1954 |
| median | 1963 |
| Q3 | 1975 |
| 95-th percentile | 2009 |
| Maximum | 2018 |
| Range | 218 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 24.92315012 |
|---|---|
| Coefficient of variation (CV) | 0.01269181716 |
| Kurtosis | 0.6211768164 |
| Mean | 1963.718024 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.1224757013 |
| Sum | 312146726 |
| Variance | 621.1634118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1957 | 12541 | 7.9% |
| 1954 | 12346 | 7.8% |
| 1967 | 10408 | 6.5% |
| 1964 | 9362 | 5.9% |
| 1960 | 7636 | 4.8% |
| 1969 | 7033 | 4.4% |
| 1943 | 5022 | 3.2% |
| 1919 | 4707 | 3.0% |
| 1950 | 4522 | 2.8% |
| 1947 | 3718 | 2.3% |
| Other values (125) | 81662 |
| Value | Count | Frequency (%) |
| 1800 | 4 | < 0.1% |
| 1820 | 6 | < 0.1% |
| 1865 | 4 | < 0.1% |
| 1870 | 10 | < 0.1% |
| 1875 | 100 | |
| 1876 | 6 | < 0.1% |
| 1880 | 55 | < 0.1% |
| 1885 | 153 | |
| 1886 | 4 | < 0.1% |
| 1887 | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 2018 | 186 | 0.1% |
| 2017 | 886 | |
| 2016 | 1032 | |
| 2015 | 1360 | |
| 2014 | 716 | |
| 2013 | 886 | |
| 2012 | 507 | 0.3% |
| 2011 | 726 | |
| 2010 | 963 | |
| 2009 | 707 |
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52305 |
| Missing (%) | 32.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.091793122 |
| Minimum | 0 |
|---|---|
| Maximum | 826 |
| Zeros | 43 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.5 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 826 |
| Range | 826 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.933322657 |
|---|---|
| Coefficient of variation (CV) | 1.402300556 |
| Kurtosis | 60245.74008 |
| Mean | 2.091793122 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 228.6851767 |
| Sum | 223093.92 |
| Variance | 8.604381812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 79357 | |
| 3 | 9230 | 5.8% |
| 2.5 | 6105 | 3.8% |
| 1 | 4683 | 2.9% |
| 1.5 | 2291 | 1.4% |
| 2.25 | 2225 | 1.4% |
| 1.75 | 1175 | 0.7% |
| 1.25 | 452 | 0.3% |
| 2.75 | 444 | 0.3% |
| 4 | 375 | 0.2% |
| Other values (30) | 315 | 0.2% |
| (Missing) | 52305 |
| Value | Count | Frequency (%) |
| 0 | 43 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| 1 | 4683 | |
| 1.25 | 452 | 0.3% |
| 1.34 | 1 | < 0.1% |
| 1.5 | 2291 | |
| 1.7 | 4 | < 0.1% |
| 1.75 | 1175 | 0.7% |
| Value | Count | Frequency (%) |
| 826 | 1 | < 0.1% |
| 275 | 2 | < 0.1% |
| 250 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 25 | 4 | < 0.1% |
| 20 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 9 | 36 | |
| 8.25 | 1 | < 0.1% |
| Distinct | 6937 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 26770 |
| Missing (%) | 16.8% |
| Memory size | 1.2 MiB |
| 2007-04-10 00:00:00 | 413 |
|---|---|
| 1999-04-01 00:00:00 | 266 |
| 2001-01-01 00:00:00 | 258 |
| 2015-11-17 00:00:00 | 160 |
| 2010-05-04 00:00:00 | 134 |
| Other values (6932) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 2511553 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 539 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 2003-11-25 00:00:00 |
|---|---|
| 2nd row | 2000-08-17 00:00:00 |
| 3rd row | 2016-06-21 00:00:00 |
| 4th row | 2006-07-12 00:00:00 |
| 5th row | 2010-02-26 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2007-04-10 00:00:00 | 413 | 0.3% |
| 1999-04-01 00:00:00 | 266 | 0.2% |
| 2001-01-01 00:00:00 | 258 | 0.2% |
| 2015-11-17 00:00:00 | 160 | 0.1% |
| 2010-05-04 00:00:00 | 134 | 0.1% |
| 2017-06-14 00:00:00 | 124 | 0.1% |
| 2018-05-29 00:00:00 | 104 | 0.1% |
| 2016-10-31 00:00:00 | 95 | 0.1% |
| 2018-07-03 00:00:00 | 88 | 0.1% |
| 2018-05-15 00:00:00 | 85 | 0.1% |
| Other values (6927) | 130460 | |
| (Missing) | 26770 | 16.8% |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 132187 | |
| 2007-04-10 | 413 | 0.2% |
| 1999-04-01 | 266 | 0.1% |
| 2001-01-01 | 258 | 0.1% |
| 2015-11-17 | 160 | 0.1% |
| 2010-05-04 | 134 | 0.1% |
| 2017-06-14 | 124 | < 0.1% |
| 2018-05-29 | 104 | < 0.1% |
| 2016-10-31 | 95 | < 0.1% |
| 2018-07-03 | 88 | < 0.1% |
| Other values (6928) | 130545 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1133288 | |
| - | 264374 | 10.5% |
| : | 264374 | 10.5% |
| 1 | 204851 | 8.2% |
| 2 | 204719 | 8.2% |
| 132187 | 5.3% | |
| 9 | 56502 | 2.2% |
| 7 | 45174 | 1.8% |
| 3 | 44372 | 1.8% |
| 6 | 42700 | 1.7% |
| Other values (3) | 119012 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1850618 | |
| Dash Punctuation | 264374 | 10.5% |
| Other Punctuation | 264374 | 10.5% |
| Space Separator | 132187 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1133288 | |
| 1 | 204851 | 11.1% |
| 2 | 204719 | 11.1% |
| 9 | 56502 | 3.1% |
| 7 | 45174 | 2.4% |
| 3 | 44372 | 2.4% |
| 6 | 42700 | 2.3% |
| 5 | 41316 | 2.2% |
| 4 | 39264 | 2.1% |
| 8 | 38432 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 264374 |
Space Separator
| Value | Count | Frequency (%) |
| 132187 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 264374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2511553 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1133288 | |
| - | 264374 | 10.5% |
| : | 264374 | 10.5% |
| 1 | 204851 | 8.2% |
| 2 | 204719 | 8.2% |
| 132187 | 5.3% | |
| 9 | 56502 | 2.2% |
| 7 | 45174 | 1.8% |
| 3 | 44372 | 1.8% |
| 6 | 42700 | 1.7% |
| Other values (3) | 119012 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2511553 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1133288 | |
| - | 264374 | 10.5% |
| : | 264374 | 10.5% |
| 1 | 204851 | 8.2% |
| 2 | 204719 | 8.2% |
| 132187 | 5.3% | |
| 9 | 56502 | 2.2% |
| 7 | 45174 | 1.8% |
| 3 | 44372 | 1.8% |
| 6 | 42700 | 1.7% |
| Other values (3) | 119012 | 4.7% |
| Distinct | 13486 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 60741 |
| Missing (%) | 38.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 931351.5949 |
| Minimum | 1 |
|---|---|
| Maximum | 137427545 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 93086.75 |
| Q1 | 240000 |
| median | 399999 |
| Q3 | 652000 |
| 95-th percentile | 1350000 |
| Maximum | 137427545 |
| Range | 137427544 |
| Interquartile range (IQR) | 412000 |
Descriptive statistics
| Standard deviation | 7061324.956 |
|---|---|
| Coefficient of variation (CV) | 7.581803686 |
| Kurtosis | 344.9019408 |
| Mean | 931351.5949 |
| Median Absolute Deviation (MAD) | 192999 |
| Skewness | 18.3162491 |
| Sum | 9.147362825 × 1010 |
| Variance | 4.986231013 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350000 | 595 | 0.4% |
| 250000 | 536 | 0.3% |
| 300000 | 523 | 0.3% |
| 450000 | 519 | 0.3% |
| 375000 | 488 | 0.3% |
| 325000 | 482 | 0.3% |
| 550000 | 459 | 0.3% |
| 275000 | 455 | 0.3% |
| 500000 | 426 | 0.3% |
| 320000 | 425 | 0.3% |
| Other values (13476) | 93308 | |
| (Missing) | 60741 |
| Value | Count | Frequency (%) |
| 1 | 5 | < 0.1% |
| 10 | 4 | < 0.1% |
| 250 | 14 | |
| 500 | 4 | < 0.1% |
| 936 | 1 | < 0.1% |
| 1000 | 5 | < 0.1% |
| 1377 | 1 | < 0.1% |
| 2000 | 2 | < 0.1% |
| 3000 | 2 | < 0.1% |
| 3270 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 137427545 | 242 | |
| 53969391 | 118 | |
| 53696391 | 1 | < 0.1% |
| 25100000 | 1 | < 0.1% |
| 25000000 | 1 | < 0.1% |
| 23960287 | 1 | < 0.1% |
| 22000000 | 1 | < 0.1% |
| 18000000 | 1 | < 0.1% |
| 16100000 | 1 | < 0.1% |
| 15000000 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| U | |
|---|---|
| Q |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 158957 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Q |
|---|---|
| 2nd row | U |
| 3rd row | Q |
| 4th row | Q |
| 5th row | U |
Common Values
| Value | Count | Frequency (%) |
| U | 82608 | |
| Q | 76349 |
Length
Pie chart
| Value | Count | Frequency (%) |
| u | 82608 | |
| q | 76349 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 82608 | |
| Q | 76349 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 158957 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 82608 | |
| Q | 76349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 158957 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 82608 | |
| Q | 76349 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 158957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 82608 | |
| Q | 76349 |
SALE_NUM
Real number (ℝ≥0)
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.680032965 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.285898145 |
|---|---|
| Coefficient of variation (CV) | 0.7654005437 |
| Kurtosis | 4.895860251 |
| Mean | 1.680032965 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.131739779 |
| Sum | 267053 |
| Variance | 1.653534039 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 113671 | |
| 3 | 14738 | 9.3% |
| 2 | 12901 | 8.1% |
| 4 | 9851 | 6.2% |
| 5 | 4687 | 2.9% |
| 6 | 1970 | 1.2% |
| 7 | 703 | 0.4% |
| 8 | 261 | 0.2% |
| 9 | 108 | 0.1% |
| 10 | 37 | < 0.1% |
| Other values (5) | 30 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 113671 | |
| 2 | 12901 | 8.1% |
| 3 | 14738 | 9.3% |
| 4 | 9851 | 6.2% |
| 5 | 4687 | 2.9% |
| 6 | 1970 | 1.2% |
| 7 | 703 | 0.4% |
| 8 | 261 | 0.2% |
| 9 | 108 | 0.1% |
| 10 | 37 | < 0.1% |
| Value | Count | Frequency (%) |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 6 | < 0.1% |
| 11 | 17 | < 0.1% |
| 10 | 37 | < 0.1% |
| 9 | 108 | 0.1% |
| 8 | 261 | 0.2% |
| 7 | 703 | 0.4% |
| 6 | 1970 |
| Distinct | 4764 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1714.539889 |
| Minimum | 0 |
|---|---|
| Maximum | 45384 |
| Zeros | 15 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 864 |
| Q1 | 1190 |
| median | 1480 |
| Q3 | 1966 |
| 95-th percentile | 3262 |
| Maximum | 45384 |
| Range | 45384 |
| Interquartile range (IQR) | 776 |
Descriptive statistics
| Standard deviation | 880.6778604 |
|---|---|
| Coefficient of variation (CV) | 0.5136525933 |
| Kurtosis | 135.3073343 |
| Mean | 1714.539889 |
| Median Absolute Deviation (MAD) | 342 |
| Skewness | 5.635667353 |
| Sum | 182934548 |
| Variance | 775593.4937 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1088 | 1782 | 1.1% |
| 1152 | 1661 | 1.0% |
| 1024 | 1405 | 0.9% |
| 832 | 1364 | 0.9% |
| 1280 | 1236 | 0.8% |
| 1080 | 1094 | 0.7% |
| 1200 | 877 | 0.6% |
| 1360 | 862 | 0.5% |
| 1440 | 815 | 0.5% |
| 800 | 723 | 0.5% |
| Other values (4754) | 94877 | |
| (Missing) | 52261 |
| Value | Count | Frequency (%) |
| 0 | 15 | |
| 180 | 1 | < 0.1% |
| 252 | 1 | < 0.1% |
| 299 | 1 | < 0.1% |
| 340 | 1 | < 0.1% |
| 360 | 1 | < 0.1% |
| 371 | 2 | < 0.1% |
| 380 | 1 | < 0.1% |
| 392 | 1 | < 0.1% |
| 396 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 45384 | 1 | |
| 41604 | 1 | |
| 27451 | 1 | |
| 24030 | 1 | |
| 21210 | 1 | |
| 20948 | 1 | |
| 20120 | 1 | |
| 20015 | 1 | |
| 18784 | 1 | |
| 18588 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 1 | |
|---|---|
| 2 | 59 |
| 3 | 8 |
| 4 | 4 |
| 5 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 158957 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 158957 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 158957 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 158957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 158884 | |
| 2 | 59 | < 0.1% |
| 3 | 8 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| 2 Story | |
|---|---|
| 3 Story | |
| 2.5 Story Fin | 7000 |
| 1 Story | 4420 |
| 1.5 Story Fin | 2655 |
| Other values (13) | 2035 |
Length
| Max length | 15 |
|---|---|
| Median length | 7 |
| Mean length | 7.636987328 |
| Min length | 6 |
Characters and Unicode
| Total characters | 814836 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 3 Story |
|---|---|
| 2nd row | 3 Story |
| 3rd row | 3 Story |
| 4th row | 3 Story |
| 5th row | 3 Story |
Common Values
| Value | Count | Frequency (%) |
| 2 Story | 81137 | |
| 3 Story | 9449 | 5.9% |
| 2.5 Story Fin | 7000 | 4.4% |
| 1 Story | 4420 | 2.8% |
| 1.5 Story Fin | 2655 | 1.7% |
| 2.5 Story Unfin | 729 | 0.5% |
| 4 Story | 369 | 0.2% |
| Split Level | 303 | 0.2% |
| Split Foyer | 279 | 0.2% |
| 3.5 Story Fin | 133 | 0.1% |
| Other values (8) | 222 | 0.1% |
| (Missing) | 52261 |
Length
| Value | Count | Frequency (%) |
| story | 106027 | |
| 2 | 81137 | |
| fin | 9801 | 4.4% |
| 3 | 9449 | 4.2% |
| 2.5 | 7729 | 3.5% |
| 1 | 4420 | 2.0% |
| 1.5 | 2767 | 1.2% |
| unfin | 851 | 0.4% |
| split | 582 | 0.3% |
| 4 | 369 | 0.2% |
| Other values (8) | 825 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 117261 | ||
| t | 106677 | |
| S | 106609 | |
| o | 106306 | |
| r | 106306 | |
| y | 106306 | |
| 2 | 88866 | |
| n | 11506 | 1.4% |
| i | 11255 | 1.4% |
| . | 10652 | 1.3% |
| Other values (24) | 43092 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 452276 | |
| Uppercase Letter | 117949 | 14.5% |
| Space Separator | 117261 | 14.4% |
| Decimal Number | 116679 | 14.3% |
| Other Punctuation | 10652 | 1.3% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 106677 | |
| o | 106306 | |
| r | 106306 | |
| y | 106306 | |
| n | 11506 | 2.5% |
| i | 11255 | 2.5% |
| e | 988 | 0.2% |
| l | 970 | 0.2% |
| f | 916 | 0.2% |
| p | 582 | 0.1% |
| Other values (8) | 464 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 106609 | |
| F | 10080 | 8.5% |
| U | 851 | 0.7% |
| L | 322 | 0.3% |
| D | 65 | 0.1% |
| B | 19 | < 0.1% |
| V | 2 | < 0.1% |
| O | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 88866 | |
| 5 | 10652 | 9.1% |
| 3 | 9590 | 8.2% |
| 1 | 7187 | 6.2% |
| 4 | 384 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 117261 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10652 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 570225 | |
| Common | 244611 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 106677 | |
| S | 106609 | |
| o | 106306 | |
| r | 106306 | |
| y | 106306 | |
| n | 11506 | 2.0% |
| i | 11255 | 2.0% |
| F | 10080 | 1.8% |
| e | 988 | 0.2% |
| l | 970 | 0.2% |
| Other values (16) | 3222 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 117261 | ||
| 2 | 88866 | |
| . | 10652 | 4.4% |
| 5 | 10652 | 4.4% |
| 3 | 9590 | 3.9% |
| 1 | 7187 | 2.9% |
| 4 | 384 | 0.2% |
| - | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 814836 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 117261 | ||
| t | 106677 | |
| S | 106609 | |
| o | 106306 | |
| r | 106306 | |
| y | 106306 | |
| 2 | 88866 | |
| n | 11506 | 1.4% |
| i | 11255 | 1.4% |
| . | 10652 | 1.3% |
| Other values (24) | 43092 | 5.3% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Row Inside | |
|---|---|
| Single | |
| Semi-Detached | |
| Row End | |
| Multi | |
| Other values (4) | 333 |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.70365337 |
| Min length | 5 |
Characters and Unicode
| Total characters | 928645 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Row Inside |
|---|---|
| 2nd row | Row Inside |
| 3rd row | Row Inside |
| 4th row | Row Inside |
| 5th row | Semi-Detached |
Common Values
| Value | Count | Frequency (%) |
| Row Inside | 40593 | |
| Single | 32063 | |
| Semi-Detached | 16756 | 10.5% |
| Row End | 12225 | 7.7% |
| Multi | 4726 | 3.0% |
| Town Inside | 218 | 0.1% |
| Town End | 85 | 0.1% |
| Default | 26 | < 0.1% |
| Vacant Land | 4 | < 0.1% |
| (Missing) | 52261 |
Length
Pie chart
| Value | Count | Frequency (%) |
| row | 52818 | |
| inside | 40811 | |
| single | 32063 | |
| semi-detached | 16756 | 10.5% |
| end | 12310 | 7.7% |
| multi | 4726 | 3.0% |
| town | 303 | 0.2% |
| default | 26 | < 0.1% |
| land | 4 | < 0.1% |
| vacant | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 123168 | |
| i | 94356 | 10.2% |
| n | 85495 | 9.2% |
| d | 69881 | 7.5% |
| 53125 | 5.7% | |
| o | 53121 | 5.7% |
| w | 53121 | 5.7% |
| R | 52818 | 5.7% |
| S | 48819 | 5.3% |
| I | 40811 | 4.4% |
| Other values (17) | 253930 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 682187 | |
| Uppercase Letter | 176577 | 19.0% |
| Space Separator | 53125 | 5.7% |
| Dash Punctuation | 16756 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 123168 | |
| i | 94356 | |
| n | 85495 | |
| d | 69881 | |
| o | 53121 | |
| w | 53121 | |
| s | 40811 | 6.0% |
| l | 36815 | 5.4% |
| g | 32063 | 4.7% |
| t | 21512 | 3.2% |
| Other values (6) | 71844 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 52818 | |
| S | 48819 | |
| I | 40811 | |
| D | 16782 | 9.5% |
| E | 12310 | 7.0% |
| M | 4726 | 2.7% |
| T | 303 | 0.2% |
| V | 4 | < 0.1% |
| L | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 53125 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 858764 | |
| Common | 69881 | 7.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 123168 | |
| i | 94356 | |
| n | 85495 | |
| d | 69881 | 8.1% |
| o | 53121 | 6.2% |
| w | 53121 | 6.2% |
| R | 52818 | 6.2% |
| S | 48819 | 5.7% |
| I | 40811 | 4.8% |
| s | 40811 | 4.8% |
| Other values (15) | 196363 |
Common
| Value | Count | Frequency (%) |
| 53125 | ||
| - | 16756 | 24.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 928645 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 123168 | |
| i | 94356 | 10.2% |
| n | 85495 | 9.2% |
| d | 69881 | 7.5% |
| 53125 | 5.7% | |
| o | 53121 | 5.7% |
| w | 53121 | 5.7% |
| R | 52818 | 5.7% |
| S | 48819 | 5.3% |
| I | 40811 | 4.4% |
| Other values (17) | 253930 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Average | |
|---|---|
| Above Average | |
| Good Quality | |
| Very Good | |
| Excellent | 3390 |
| Other values (8) |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 10.11468096 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1079196 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very Good |
|---|---|
| 2nd row | Very Good |
| 3rd row | Very Good |
| 4th row | Very Good |
| 5th row | Very Good |
Common Values
| Value | Count | Frequency (%) |
| Average | 37357 | |
| Above Average | 32101 | |
| Good Quality | 20800 | 13.1% |
| Very Good | 8976 | 5.6% |
| Excellent | 3390 | 2.1% |
| Superior | 2634 | 1.7% |
| Exceptional-A | 818 | 0.5% |
| Exceptional-B | 278 | 0.2% |
| Fair Quality | 150 | 0.1% |
| Exceptional-C | 92 | 0.1% |
| Other values (3) | 100 | 0.1% |
| (Missing) | 52261 |
Length
| Value | Count | Frequency (%) |
| average | 69458 | |
| above | 32101 | |
| good | 29776 | |
| quality | 20956 | 12.4% |
| very | 8976 | 5.3% |
| excellent | 3390 | 2.0% |
| superior | 2634 | 1.6% |
| exceptional-a | 818 | 0.5% |
| exceptional-b | 278 | 0.2% |
| fair | 150 | 0.1% |
| Other values (5) | 211 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 190670 | |
| A | 102377 | |
| v | 101559 | |
| o | 95575 | |
| a | 91865 | |
| r | 83852 | 7.8% |
| g | 69458 | 6.4% |
| 62052 | 5.7% | |
| b | 32101 | 3.0% |
| y | 29932 | 2.8% |
| Other values (22) | 219755 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 845870 | |
| Uppercase Letter | 170011 | 15.8% |
| Space Separator | 62052 | 5.7% |
| Dash Punctuation | 1263 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 190670 | |
| v | 101559 | |
| o | 95575 | |
| a | 91865 | |
| r | 83852 | |
| g | 69458 | 8.2% |
| b | 32101 | 3.8% |
| y | 29932 | 3.5% |
| d | 29776 | 3.5% |
| l | 28999 | 3.4% |
| Other values (8) | 92083 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 102377 | |
| G | 29776 | 17.5% |
| Q | 20956 | 12.3% |
| V | 8976 | 5.3% |
| E | 4653 | 2.7% |
| S | 2634 | 1.5% |
| B | 278 | 0.2% |
| F | 150 | 0.1% |
| D | 94 | 0.1% |
| C | 92 | 0.1% |
| Other values (2) | 25 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 62052 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1263 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1015881 | |
| Common | 63315 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 190670 | |
| A | 102377 | |
| v | 101559 | |
| o | 95575 | |
| a | 91865 | |
| r | 83852 | |
| g | 69458 | 6.8% |
| b | 32101 | 3.2% |
| y | 29932 | 2.9% |
| G | 29776 | 2.9% |
| Other values (20) | 188716 |
Common
| Value | Count | Frequency (%) |
| 62052 | ||
| - | 1263 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1079196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 190670 | |
| A | 102377 | |
| v | 101559 | |
| o | 95575 | |
| a | 91865 | |
| r | 83852 | 7.8% |
| g | 69458 | 6.4% |
| 62052 | 5.7% | |
| b | 32101 | 3.0% |
| y | 29932 | 2.8% |
| Other values (22) | 219755 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Average | |
|---|---|
| Good | |
| Very Good | |
| Excellent | 1338 |
| Fair | 1320 |
| Other values (2) | 194 |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.08112769 |
| Min length | 4 |
Characters and Unicode
| Total characters | 648832 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Good |
| 3rd row | Very Good |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Average | 58217 | |
| Good | 37497 | |
| Very Good | 8130 | 5.1% |
| Excellent | 1338 | 0.8% |
| Fair | 1320 | 0.8% |
| Poor | 175 | 0.1% |
| Default | 19 | < 0.1% |
| (Missing) | 52261 |
Length
Pie chart
| Value | Count | Frequency (%) |
| average | 58217 | |
| good | 45627 | |
| very | 8130 | 7.1% |
| excellent | 1338 | 1.2% |
| fair | 1320 | 1.1% |
| poor | 175 | 0.2% |
| default | 19 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 127259 | |
| o | 91604 | |
| r | 67842 | |
| a | 59556 | |
| A | 58217 | |
| v | 58217 | |
| g | 58217 | |
| G | 45627 | 7.0% |
| d | 45627 | 7.0% |
| V | 8130 | 1.3% |
| Other values (14) | 28536 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 525876 | |
| Uppercase Letter | 114826 | 17.7% |
| Space Separator | 8130 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 127259 | |
| o | 91604 | |
| r | 67842 | |
| a | 59556 | |
| v | 58217 | |
| g | 58217 | |
| d | 45627 | 8.7% |
| y | 8130 | 1.5% |
| l | 2695 | 0.5% |
| t | 1357 | 0.3% |
| Other values (6) | 5372 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 58217 | |
| G | 45627 | |
| V | 8130 | 7.1% |
| E | 1338 | 1.2% |
| F | 1320 | 1.1% |
| P | 175 | 0.2% |
| D | 19 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8130 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 640702 | |
| Common | 8130 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 127259 | |
| o | 91604 | |
| r | 67842 | |
| a | 59556 | |
| A | 58217 | |
| v | 58217 | |
| g | 58217 | |
| G | 45627 | 7.1% |
| d | 45627 | 7.1% |
| V | 8130 | 1.3% |
| Other values (13) | 20406 | 3.2% |
Common
| Value | Count | Frequency (%) |
| 8130 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 648832 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 127259 | |
| o | 91604 | |
| r | 67842 | |
| a | 59556 | |
| A | 58217 | |
| v | 58217 | |
| g | 58217 | |
| G | 45627 | 7.0% |
| d | 45627 | 7.0% |
| V | 8130 | 1.3% |
| Other values (14) | 28536 | 4.4% |
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Common Brick | |
|---|---|
| Brick/Siding | 5569 |
| Vinyl Siding | 5290 |
| Wood Siding | 4540 |
| Stucco | 3216 |
| Other values (20) | 7013 |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.61341569 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1239105 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Common Brick |
|---|---|
| 2nd row | Common Brick |
| 3rd row | Common Brick |
| 4th row | Common Brick |
| 5th row | Common Brick |
Common Values
| Value | Count | Frequency (%) |
| Common Brick | 81068 | |
| Brick/Siding | 5569 | 3.5% |
| Vinyl Siding | 5290 | 3.3% |
| Wood Siding | 4540 | 2.9% |
| Stucco | 3216 | 2.0% |
| Shingle | 1181 | 0.7% |
| Brick Veneer | 1069 | 0.7% |
| Aluminum | 954 | 0.6% |
| Stone | 744 | 0.5% |
| Brick/Stucco | 673 | 0.4% |
| Other values (15) | 2392 | 1.5% |
| (Missing) | 52261 |
Length
| Value | Count | Frequency (%) |
| brick | 82649 | |
| common | 81068 | |
| siding | 9896 | 5.0% |
| brick/siding | 5569 | 2.8% |
| vinyl | 5290 | 2.7% |
| wood | 4540 | 2.3% |
| stucco | 3267 | 1.6% |
| veneer | 1323 | 0.7% |
| shingle | 1181 | 0.6% |
| stone | 998 | 0.5% |
| Other values (16) | 3820 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 177988 | |
| m | 164044 | |
| i | 128551 | |
| n | 107957 | |
| c | 98627 | |
| 92905 | ||
| r | 91215 | |
| B | 89622 | |
| k | 89622 | |
| C | 81204 | |
| Other values (25) | 117370 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 931708 | |
| Uppercase Letter | 207047 | 16.7% |
| Space Separator | 92905 | 7.5% |
| Other Punctuation | 7445 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 177988 | |
| m | 164044 | |
| i | 128551 | |
| n | 107957 | |
| c | 98627 | |
| r | 91215 | |
| k | 89622 | |
| d | 20599 | 2.2% |
| g | 16986 | 1.8% |
| e | 8236 | 0.9% |
| Other values (10) | 27883 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 89622 | |
| C | 81204 | |
| S | 23365 | 11.3% |
| V | 6613 | 3.2% |
| W | 4540 | 2.2% |
| A | 956 | 0.5% |
| F | 512 | 0.2% |
| H | 119 | 0.1% |
| M | 66 | < 0.1% |
| D | 32 | < 0.1% |
| Other values (3) | 18 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 92905 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 7445 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1138755 | |
| Common | 100350 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 177988 | |
| m | 164044 | |
| i | 128551 | |
| n | 107957 | |
| c | 98627 | |
| r | 91215 | |
| B | 89622 | |
| k | 89622 | |
| C | 81204 | |
| S | 23365 | 2.1% |
| Other values (23) | 86560 |
Common
| Value | Count | Frequency (%) |
| 92905 | ||
| / | 7445 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1239105 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 177988 | |
| m | 164044 | |
| i | 128551 | |
| n | 107957 | |
| c | 98627 | |
| 92905 | ||
| r | 91215 | |
| B | 89622 | |
| k | 89622 | |
| C | 81204 | |
| Other values (25) | 117370 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Built Up | |
|---|---|
| Comp Shingle | |
| Metal- Sms | |
| Slate | |
| Neopren | 1254 |
| Other values (11) | 2647 |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 9.359226213 |
| Min length | 5 |
Characters and Unicode
| Total characters | 998592 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Metal- Sms |
|---|---|
| 2nd row | Built Up |
| 3rd row | Built Up |
| 4th row | Built Up |
| 5th row | Neopren |
Common Values
| Value | Count | Frequency (%) |
| Built Up | 31402 | |
| Comp Shingle | 30301 | |
| Metal- Sms | 29957 | |
| Slate | 11135 | 7.0% |
| Neopren | 1254 | 0.8% |
| Shake | 907 | 0.6% |
| Clay Tile | 654 | 0.4% |
| Shingle | 433 | 0.3% |
| Metal- Pre | 244 | 0.2% |
| Typical | 229 | 0.1% |
| Other values (6) | 180 | 0.1% |
| (Missing) | 52261 |
Length
| Value | Count | Frequency (%) |
| built | 31402 | |
| up | 31402 | |
| shingle | 30734 | |
| comp | 30301 | |
| metal | 30242 | |
| sms | 29957 | |
| slate | 11135 | 5.6% |
| neopren | 1254 | 0.6% |
| shake | 907 | 0.5% |
| tile | 671 | 0.3% |
| Other values (11) | 1425 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 105067 | 10.5% |
| 92734 | 9.3% | |
| e | 76492 | 7.7% |
| t | 72911 | 7.3% |
| S | 72740 | 7.3% |
| p | 63329 | 6.3% |
| i | 63240 | 6.3% |
| m | 60360 | 6.0% |
| a | 43176 | 4.3% |
| n | 32111 | 3.2% |
| Other values (22) | 316432 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 676172 | |
| Uppercase Letter | 199437 | 20.0% |
| Space Separator | 92734 | 9.3% |
| Dash Punctuation | 30249 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 105067 | |
| e | 76492 | |
| t | 72911 | |
| p | 63329 | |
| i | 63240 | |
| m | 60360 | |
| a | 43176 | |
| n | 32111 | 4.7% |
| o | 32016 | 4.7% |
| h | 31641 | 4.7% |
| Other values (9) | 95829 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 72740 | |
| B | 31402 | |
| U | 31402 | |
| C | 31119 | |
| M | 30242 | |
| N | 1254 | 0.6% |
| T | 900 | 0.5% |
| P | 253 | 0.1% |
| R | 102 | 0.1% |
| W | 16 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30249 |
Space Separator
| Value | Count | Frequency (%) |
| 92734 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 875609 | |
| Common | 122983 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 105067 | |
| e | 76492 | 8.7% |
| t | 72911 | 8.3% |
| S | 72740 | 8.3% |
| p | 63329 | 7.2% |
| i | 63240 | 7.2% |
| m | 60360 | 6.9% |
| a | 43176 | 4.9% |
| n | 32111 | 3.7% |
| o | 32016 | 3.7% |
| Other values (20) | 254167 |
Common
| Value | Count | Frequency (%) |
| 92734 | ||
| - | 30249 | 24.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 998592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 105067 | 10.5% |
| 92734 | 9.3% | |
| e | 76492 | 7.7% |
| t | 72911 | 7.3% |
| S | 72740 | 7.3% |
| p | 63329 | 6.3% |
| i | 63240 | 6.3% |
| m | 60360 | 6.0% |
| a | 43176 | 4.3% |
| n | 32111 | 3.2% |
| Other values (22) | 316432 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52261 |
| Missing (%) | 32.9% |
| Memory size | 1.2 MiB |
| Hardwood | |
|---|---|
| Hardwood/Carp | |
| Wood Floor | 8170 |
| Carpet | 3563 |
| Lt Concrete | 141 |
| Other values (7) | 241 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.604540001 |
| Min length | 6 |
Characters and Unicode
| Total characters | 918070 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hardwood |
|---|---|
| 2nd row | Hardwood |
| 3rd row | Hardwood |
| 4th row | Hardwood |
| 5th row | Hardwood |
Common Values
| Value | Count | Frequency (%) |
| Hardwood | 83643 | |
| Hardwood/Carp | 10938 | 6.9% |
| Wood Floor | 8170 | 5.1% |
| Carpet | 3563 | 2.2% |
| Lt Concrete | 141 | 0.1% |
| Default | 110 | 0.1% |
| Ceramic Tile | 50 | < 0.1% |
| Vinyl Comp | 28 | < 0.1% |
| Parquet | 19 | < 0.1% |
| Resiliant | 15 | < 0.1% |
| Other values (2) | 19 | < 0.1% |
| (Missing) | 52261 |
Length
| Value | Count | Frequency (%) |
| hardwood | 83643 | |
| hardwood/carp | 10938 | 9.5% |
| floor | 8170 | 7.1% |
| wood | 8170 | 7.1% |
| carpet | 3563 | 3.1% |
| concrete | 141 | 0.1% |
| lt | 141 | 0.1% |
| default | 110 | 0.1% |
| tile | 50 | < 0.1% |
| ceramic | 50 | < 0.1% |
| Other values (6) | 122 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 222017 | |
| d | 197332 | |
| r | 117474 | |
| a | 109282 | |
| H | 94581 | |
| w | 94581 | |
| C | 14720 | 1.6% |
| p | 14529 | 1.6% |
| / | 10938 | 1.2% |
| 8402 | 0.9% | |
| Other values (23) | 34214 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 772694 | |
| Uppercase Letter | 126036 | 13.7% |
| Other Punctuation | 10938 | 1.2% |
| Space Separator | 8402 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 222017 | |
| d | 197332 | |
| r | 117474 | |
| a | 109282 | |
| w | 94581 | |
| p | 14529 | 1.9% |
| l | 8386 | 1.1% |
| e | 4121 | 0.5% |
| t | 4002 | 0.5% |
| n | 197 | < 0.1% |
| Other values (10) | 773 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 94581 | |
| C | 14720 | 11.7% |
| W | 8170 | 6.5% |
| F | 8170 | 6.5% |
| L | 141 | 0.1% |
| D | 110 | 0.1% |
| T | 56 | < 0.1% |
| V | 41 | < 0.1% |
| P | 19 | < 0.1% |
| R | 15 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8402 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 10938 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 898730 | |
| Common | 19340 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 222017 | |
| d | 197332 | |
| r | 117474 | |
| a | 109282 | |
| H | 94581 | |
| w | 94581 | |
| C | 14720 | 1.6% |
| p | 14529 | 1.6% |
| l | 8386 | 0.9% |
| W | 8170 | 0.9% |
| Other values (21) | 17658 | 2.0% |
Common
| Value | Count | Frequency (%) |
| / | 10938 | |
| 8402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 918070 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 222017 | |
| d | 197332 | |
| r | 117474 | |
| a | 109282 | |
| H | 94581 | |
| w | 94581 | |
| C | 14720 | 1.6% |
| p | 14529 | 1.6% |
| / | 10938 | 1.2% |
| 8402 | 0.9% | |
| Other values (23) | 34214 | 3.7% |
KITCHENS
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52262 |
| Missing (%) | 32.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.219251136 |
| Minimum | 0 |
|---|---|
| Maximum | 44 |
| Zeros | 117 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 44 |
| Range | 44 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6211695991 |
|---|---|
| Coefficient of variation (CV) | 0.5094681321 |
| Kurtosis | 220.6893857 |
| Mean | 1.219251136 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.102645696 |
| Sum | 130088 |
| Variance | 0.3858516708 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 90434 | |
| 2 | 11904 | 7.5% |
| 4 | 3051 | 1.9% |
| 3 | 1173 | 0.7% |
| 0 | 117 | 0.1% |
| 5 | 11 | < 0.1% |
| 6 | 4 | < 0.1% |
| 44 | 1 | < 0.1% |
| (Missing) | 52262 |
| Value | Count | Frequency (%) |
| 0 | 117 | 0.1% |
| 1 | 90434 | |
| 2 | 11904 | 7.5% |
| 3 | 1173 | 0.7% |
| 4 | 3051 | 1.9% |
| 5 | 11 | < 0.1% |
| 6 | 4 | < 0.1% |
| 44 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 6 | 4 | < 0.1% |
| 5 | 11 | < 0.1% |
| 4 | 3051 | 1.9% |
| 3 | 1173 | 0.7% |
| 2 | 11904 | 7.5% |
| 1 | 90434 | |
| 0 | 117 | 0.1% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.374673654 |
| Minimum | 0 |
|---|---|
| Maximum | 293920 |
| Zeros | 103837 |
| Zeros (%) | 65.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 293920 |
| Range | 293920 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 737.2955949 |
|---|---|
| Coefficient of variation (CV) | 310.4829136 |
| Kurtosis | 158879.2742 |
| Mean | 2.374673654 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 398.5490354 |
| Sum | 377471 |
| Variance | 543604.7943 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 103837 | |
| 1 | 40567 | 25.5% |
| 2 | 10779 | 6.8% |
| 3 | 2410 | 1.5% |
| 4 | 841 | 0.5% |
| 5 | 277 | 0.2% |
| 6 | 148 | 0.1% |
| 7 | 47 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 10 | < 0.1% |
| Other values (10) | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 103837 | |
| 1 | 40567 | 25.5% |
| 2 | 10779 | 6.8% |
| 3 | 2410 | 1.5% |
| 4 | 841 | 0.5% |
| 5 | 277 | 0.2% |
| 6 | 148 | 0.1% |
| 7 | 47 | < 0.1% |
| 8 | 18 | < 0.1% |
| 9 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 293920 | 1 | < 0.1% |
| 4068 | 1 | < 0.1% |
| 1601 | 1 | < 0.1% |
| 1017 | 1 | < 0.1% |
| 922 | 1 | < 0.1% |
| 200 | 1 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 3 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 8 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.25299924 |
| Minimum | 11 |
|---|---|
| Maximum | 117 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 11 |
| median | 13 |
| Q3 | 17 |
| 95-th percentile | 24 |
| Maximum | 117 |
| Range | 106 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.725735883 |
|---|---|
| Coefficient of variation (CV) | 0.261400132 |
| Kurtosis | 37.24269694 |
| Mean | 14.25299924 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.556818989 |
| Sum | 2265614 |
| Variance | 13.88110787 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 45597 | |
| 12 | 31623 | |
| 17 | 27511 | |
| 16 | 24741 | |
| 13 | 16588 | 10.4% |
| 24 | 8272 | 5.2% |
| 23 | 4497 | 2.8% |
| 15 | 79 | < 0.1% |
| 19 | 31 | < 0.1% |
| 117 | 8 | < 0.1% |
| Other values (6) | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 45597 | |
| 12 | 31623 | |
| 13 | 16588 | 10.4% |
| 15 | 79 | < 0.1% |
| 16 | 24741 | |
| 17 | 27511 | |
| 19 | 31 | < 0.1% |
| 23 | 4497 | 2.8% |
| 24 | 8272 | 5.2% |
| 29 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 117 | 8 | < 0.1% |
| 116 | 1 | < 0.1% |
| 83 | 2 | < 0.1% |
| 81 | 4 | < 0.1% |
| 41 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 24 | 8272 | |
| 23 | 4497 | |
| 19 | 31 | < 0.1% |
| Distinct | 11359 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2473.282158 |
| Minimum | 0 |
|---|---|
| Maximum | 942632 |
| Zeros | 72 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 137 |
| Q1 | 697 |
| median | 1649 |
| Q3 | 3000 |
| 95-th percentile | 7475 |
| Maximum | 942632 |
| Range | 942632 |
| Interquartile range (IQR) | 2303 |
Descriptive statistics
| Standard deviation | 5059.046023 |
|---|---|
| Coefficient of variation (CV) | 2.04547872 |
| Kurtosis | 11264.01477 |
| Mean | 2473.282158 |
| Median Absolute Deviation (MAD) | 1092 |
| Skewness | 78.59012056 |
| Sum | 393145512 |
| Variance | 25593946.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 1071 | 0.7% |
| 2000 | 1020 | 0.6% |
| 4000 | 848 | 0.5% |
| 5000 | 833 | 0.5% |
| 1600 | 792 | 0.5% |
| 2500 | 601 | 0.4% |
| 1700 | 562 | 0.4% |
| 1440 | 552 | 0.3% |
| 1500 | 530 | 0.3% |
| 3000 | 511 | 0.3% |
| Other values (11349) | 151637 |
| Value | Count | Frequency (%) |
| 0 | 72 | |
| 1 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4 | 12 | < 0.1% |
| 5 | 15 | < 0.1% |
| 6 | 31 | < 0.1% |
| 7 | 63 | |
| 8 | 44 | < 0.1% |
| 9 | 112 |
| Value | Count | Frequency (%) |
| 942632 | 1 | |
| 691817 | 1 | |
| 498734 | 1 | |
| 451804 | 1 | |
| 339658 | 1 | |
| 338435 | 2 | |
| 329174 | 1 | |
| 240377 | 1 | |
| 227446 | 1 | |
| 226479 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 2018-07-22 18:01:43 | |
|---|---|
| 2018-07-22 18:01:38 |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 3020183 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018-07-22 18:01:43 |
|---|---|
| 2nd row | 2018-07-22 18:01:43 |
| 3rd row | 2018-07-22 18:01:43 |
| 4th row | 2018-07-22 18:01:43 |
| 5th row | 2018-07-22 18:01:43 |
Common Values
| Value | Count | Frequency (%) |
| 2018-07-22 18:01:43 | 106696 | |
| 2018-07-22 18:01:38 | 52261 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2018-07-22 | 158957 | |
| 18:01:43 | 106696 | |
| 18:01:38 | 52261 | 16.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 476871 | |
| 0 | 476871 | |
| 1 | 476871 | |
| 8 | 370175 | |
| - | 317914 | |
| : | 317914 | |
| 7 | 158957 | 5.3% |
| 158957 | 5.3% | |
| 3 | 158957 | 5.3% |
| 4 | 106696 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2225398 | |
| Dash Punctuation | 317914 | 10.5% |
| Other Punctuation | 317914 | 10.5% |
| Space Separator | 158957 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 476871 | |
| 0 | 476871 | |
| 1 | 476871 | |
| 8 | 370175 | |
| 7 | 158957 | 7.1% |
| 3 | 158957 | 7.1% |
| 4 | 106696 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 317914 |
Space Separator
| Value | Count | Frequency (%) |
| 158957 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 317914 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3020183 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 476871 | |
| 0 | 476871 | |
| 1 | 476871 | |
| 8 | 370175 | |
| - | 317914 | |
| : | 317914 | |
| 7 | 158957 | 5.3% |
| 158957 | 5.3% | |
| 3 | 158957 | 5.3% |
| 4 | 106696 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3020183 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 476871 | |
| 0 | 476871 | |
| 1 | 476871 | |
| 8 | 370175 | |
| - | 317914 | |
| : | 317914 | |
| 7 | 158957 | 5.3% |
| 158957 | 5.3% | |
| 3 | 158957 | 5.3% |
| 4 | 106696 | 3.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| Residential | |
|---|---|
| Condominium |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1748527 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Residential |
|---|---|
| 2nd row | Residential |
| 3rd row | Residential |
| 4th row | Residential |
| 5th row | Residential |
Common Values
| Value | Count | Frequency (%) |
| Residential | 106696 | |
| Condominium | 52261 |
Length
Pie chart
| Value | Count | Frequency (%) |
| residential | 106696 | |
| condominium | 52261 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 317914 | |
| e | 213392 | |
| n | 211218 | |
| d | 158957 | |
| R | 106696 | 6.1% |
| s | 106696 | 6.1% |
| t | 106696 | 6.1% |
| a | 106696 | 6.1% |
| l | 106696 | 6.1% |
| o | 104522 | 6.0% |
| Other values (3) | 209044 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1589570 | |
| Uppercase Letter | 158957 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 317914 | |
| e | 213392 | |
| n | 211218 | |
| d | 158957 | |
| s | 106696 | 6.7% |
| t | 106696 | 6.7% |
| a | 106696 | 6.7% |
| l | 106696 | 6.7% |
| o | 104522 | 6.6% |
| m | 104522 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 106696 | |
| C | 52261 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1748527 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 317914 | |
| e | 213392 | |
| n | 211218 | |
| d | 158957 | |
| R | 106696 | 6.1% |
| s | 106696 | 6.1% |
| t | 106696 | 6.1% |
| a | 106696 | 6.1% |
| l | 106696 | 6.1% |
| o | 104522 | 6.0% |
| Other values (3) | 209044 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1748527 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 317914 | |
| e | 213392 | |
| n | 211218 | |
| d | 158957 | |
| R | 106696 | 6.1% |
| s | 106696 | 6.1% |
| t | 106696 | 6.1% |
| a | 106696 | 6.1% |
| l | 106696 | 6.1% |
| o | 104522 | 6.0% |
| Other values (3) | 209044 |
| Distinct | 2913 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 106696 |
| Missing (%) | 67.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2371.544249 |
| Minimum | 1001 |
|---|---|
| Maximum | 5621 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 1066 |
| Q1 | 1501 |
| median | 2265 |
| Q3 | 2910 |
| 95-th percentile | 5176 |
| Maximum | 5621 |
| Range | 4620 |
| Interquartile range (IQR) | 1409 |
Descriptive statistics
| Standard deviation | 1114.272364 |
|---|---|
| Coefficient of variation (CV) | 0.469850969 |
| Kurtosis | 1.140354537 |
| Mean | 2371.544249 |
| Median Absolute Deviation (MAD) | 709 |
| Skewness | 1.141672933 |
| Sum | 123939274 |
| Variance | 1241602.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1066 | 720 | 0.5% |
| 2423 | 615 | 0.4% |
| 1080 | 429 | 0.3% |
| 2282 | 423 | 0.3% |
| 2838 | 396 | 0.2% |
| 1657 | 360 | 0.2% |
| 2279 | 324 | 0.2% |
| 2661 | 302 | 0.2% |
| 2898 | 292 | 0.2% |
| 2430 | 291 | 0.2% |
| Other values (2903) | 48109 | |
| (Missing) | 106696 |
| Value | Count | Frequency (%) |
| 1001 | 36 | < 0.1% |
| 1002 | 157 | |
| 1003 | 16 | < 0.1% |
| 1004 | 21 | < 0.1% |
| 1005 | 3 | < 0.1% |
| 1006 | 4 | < 0.1% |
| 1007 | 8 | < 0.1% |
| 1008 | 36 | < 0.1% |
| 1009 | 101 | |
| 1010 | 97 |
| Value | Count | Frequency (%) |
| 5621 | 2 | < 0.1% |
| 5620 | 4 | < 0.1% |
| 5619 | 11 | < 0.1% |
| 5617 | 4 | < 0.1% |
| 5616 | 71 | |
| 5615 | 5 | < 0.1% |
| 5614 | 2 | < 0.1% |
| 5612 | 2 | < 0.1% |
| 5611 | 10 | < 0.1% |
| 5610 | 10 | < 0.1% |
LIVING_GBA
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 2216 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 106696 |
| Missing (%) | 67.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 888.834542 |
| Minimum | 0 |
|---|---|
| Maximum | 8553 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 440 |
| Q1 | 616 |
| median | 783 |
| Q3 | 1060 |
| 95-th percentile | 1662 |
| Maximum | 8553 |
| Range | 8553 |
| Interquartile range (IQR) | 444 |
Descriptive statistics
| Standard deviation | 420.1858218 |
|---|---|
| Coefficient of variation (CV) | 0.4727379528 |
| Kurtosis | 15.69514905 |
| Mean | 888.834542 |
| Median Absolute Deviation (MAD) | 206 |
| Skewness | 2.556377435 |
| Sum | 46451382 |
| Variance | 176556.1248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 888 | 205 | 0.1% |
| 740 | 185 | 0.1% |
| 1210 | 179 | 0.1% |
| 670 | 175 | 0.1% |
| 1332 | 168 | 0.1% |
| 810 | 148 | 0.1% |
| 575 | 145 | 0.1% |
| 504 | 144 | 0.1% |
| 625 | 143 | 0.1% |
| 749 | 137 | 0.1% |
| Other values (2206) | 50632 | |
| (Missing) | 106696 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 104 | 1 | < 0.1% |
| 148 | 1 | < 0.1% |
| 199 | 1 | < 0.1% |
| 209 | 1 | < 0.1% |
| 217 | 1 | < 0.1% |
| 231 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
| 237 | 1 | < 0.1% |
| 238 | 3 |
| Value | Count | Frequency (%) |
| 8553 | 1 | |
| 7164 | 1 | |
| 6145 | 1 | |
| 6116 | 1 | |
| 6034 | 1 | |
| 6019 | 1 | |
| 5991 | 1 | |
| 5930 | 1 | |
| 5857 | 1 | |
| 5664 | 1 |
| Distinct | 105978 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 52917 |
| Missing (%) | 33.3% |
| Memory size | 1.2 MiB |
| 1754 STANTON TERRACE SE | 5 |
|---|---|
| 1755 STANTON TERRACE SE | 5 |
| 1517 SHIPPEN LANE SE | 4 |
| 1508 SHIPPEN LANE SE | 4 |
| 1530 34TH STREET NW | 3 |
| Other values (105973) |
Length
| Max length | 41 |
|---|---|
| Median length | 20 |
| Mean length | 20.21610713 |
| Min length | 13 |
Characters and Unicode
| Total characters | 2143716 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 105930 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | 1748 SWANN STREET NW |
|---|---|
| 2nd row | 1746 SWANN STREET NW |
| 3rd row | 1744 SWANN STREET NW |
| 4th row | 1742 SWANN STREET NW |
| 5th row | 1804 NEW HAMPSHIRE AVENUE NW |
Common Values
| Value | Count | Frequency (%) |
| 1754 STANTON TERRACE SE | 5 | < 0.1% |
| 1755 STANTON TERRACE SE | 5 | < 0.1% |
| 1517 SHIPPEN LANE SE | 4 | < 0.1% |
| 1508 SHIPPEN LANE SE | 4 | < 0.1% |
| 1530 34TH STREET NW | 3 | < 0.1% |
| 1507 TOBIAS DRIVE SE | 3 | < 0.1% |
| 312 MILLERS COURT NE | 3 | < 0.1% |
| 2600 TILDEN STREET NW | 3 | < 0.1% |
| 435 1ST STREET SE | 2 | < 0.1% |
| 3121 O STREET NW | 2 | < 0.1% |
| Other values (105968) | 106006 | |
| (Missing) | 52917 |
Length
| Value | Count | Frequency (%) |
| street | 70604 | 16.3% |
| nw | 50373 | 11.7% |
| ne | 32528 | 7.5% |
| se | 21799 | 5.0% |
| place | 14390 | 3.3% |
| avenue | 10741 | 2.5% |
| road | 4670 | 1.1% |
| terrace | 1673 | 0.4% |
| 13th | 1451 | 0.3% |
| sw | 1340 | 0.3% |
| Other values (7137) | 222313 |
Most occurring characters
| Value | Count | Frequency (%) |
| 325842 | ||
| E | 285614 | |
| T | 198655 | 9.3% |
| N | 142771 | 6.7% |
| R | 124381 | 5.8% |
| S | 123510 | 5.8% |
| 1 | 88938 | 4.1% |
| A | 84258 | 3.9% |
| W | 60982 | 2.8% |
| 2 | 60729 | 2.8% |
| Other values (29) | 648036 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1375524 | |
| Decimal Number | 441577 | 20.6% |
| Space Separator | 325842 | 15.2% |
| Other Punctuation | 773 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 285614 | |
| T | 198655 | |
| N | 142771 | |
| R | 124381 | |
| S | 123510 | |
| A | 84258 | 6.1% |
| W | 60982 | 4.4% |
| L | 43615 | 3.2% |
| O | 43077 | 3.1% |
| H | 41873 | 3.0% |
| Other values (16) | 226788 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 88938 | |
| 2 | 60729 | |
| 3 | 59379 | |
| 4 | 50250 | |
| 0 | 44482 | |
| 5 | 39345 | |
| 6 | 29646 | 6.7% |
| 7 | 25592 | 5.8% |
| 8 | 22639 | 5.1% |
| 9 | 20577 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 562 | |
| ' | 211 | 27.3% |
Space Separator
| Value | Count | Frequency (%) |
| 325842 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1375524 | |
| Common | 768192 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 285614 | |
| T | 198655 | |
| N | 142771 | |
| R | 124381 | |
| S | 123510 | |
| A | 84258 | 6.1% |
| W | 60982 | 4.4% |
| L | 43615 | 3.2% |
| O | 43077 | 3.1% |
| H | 41873 | 3.0% |
| Other values (16) | 226788 |
Common
| Value | Count | Frequency (%) |
| 325842 | ||
| 1 | 88938 | 11.6% |
| 2 | 60729 | 7.9% |
| 3 | 59379 | 7.7% |
| 4 | 50250 | 6.5% |
| 0 | 44482 | 5.8% |
| 5 | 39345 | 5.1% |
| 6 | 29646 | 3.9% |
| 7 | 25592 | 3.3% |
| 8 | 22639 | 2.9% |
| Other values (3) | 21350 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2143716 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 325842 | ||
| E | 285614 | |
| T | 198655 | 9.3% |
| N | 142771 | 6.7% |
| R | 124381 | 5.8% |
| S | 123510 | 5.8% |
| 1 | 88938 | 4.1% |
| A | 84258 | 3.9% |
| W | 60982 | 2.8% |
| 2 | 60729 | 2.8% |
| Other values (29) | 648036 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52906 |
| Missing (%) | 33.3% |
| Memory size | 1.2 MiB |
| WASHINGTON |
|---|
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1060510 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WASHINGTON |
|---|---|
| 2nd row | WASHINGTON |
| 3rd row | WASHINGTON |
| 4th row | WASHINGTON |
| 5th row | WASHINGTON |
Common Values
| Value | Count | Frequency (%) |
| WASHINGTON | 106051 | |
| (Missing) | 52906 |
Length
Pie chart
| Value | Count | Frequency (%) |
| washington | 106051 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 212102 | |
| W | 106051 | |
| A | 106051 | |
| S | 106051 | |
| H | 106051 | |
| I | 106051 | |
| G | 106051 | |
| T | 106051 | |
| O | 106051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1060510 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 212102 | |
| W | 106051 | |
| A | 106051 | |
| S | 106051 | |
| H | 106051 | |
| I | 106051 | |
| G | 106051 | |
| T | 106051 | |
| O | 106051 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1060510 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 212102 | |
| W | 106051 | |
| A | 106051 | |
| S | 106051 | |
| H | 106051 | |
| I | 106051 | |
| G | 106051 | |
| T | 106051 | |
| O | 106051 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1060510 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 212102 | |
| W | 106051 | |
| A | 106051 | |
| S | 106051 | |
| H | 106051 | |
| I | 106051 | |
| G | 106051 | |
| T | 106051 | |
| O | 106051 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 52906 |
| Missing (%) | 33.3% |
| Memory size | 1.2 MiB |
| DC |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 212102 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DC |
|---|---|
| 2nd row | DC |
| 3rd row | DC |
| 4th row | DC |
| 5th row | DC |
Common Values
| Value | Count | Frequency (%) |
| DC | 106051 | |
| (Missing) | 52906 |
Length
Pie chart
| Value | Count | Frequency (%) |
| dc | 106051 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 106051 | |
| C | 106051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 212102 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 106051 | |
| C | 106051 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 212102 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 106051 | |
| C | 106051 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 212102 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 106051 | |
| C | 106051 |
ZIPCODE
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20012.69456 |
| Minimum | 20001 |
|---|---|
| Maximum | 20392 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 20001 |
|---|---|
| 5-th percentile | 20001 |
| Q1 | 20007 |
| median | 20011 |
| Q3 | 20018 |
| 95-th percentile | 20032 |
| Maximum | 20392 |
| Range | 391 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 15.62708441 |
|---|---|
| Coefficient of variation (CV) | 0.0007808585878 |
| Kurtosis | 403.503694 |
| Mean | 20012.69456 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 16.86329334 |
| Sum | 3181137877 |
| Variance | 244.2057673 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20011 | 16352 | 10.3% |
| 20002 | 16310 | 10.3% |
| 20009 | 13171 | 8.3% |
| 20019 | 12458 | 7.8% |
| 20016 | 10644 | 6.7% |
| 20001 | 10549 | 6.6% |
| 20020 | 9805 | 6.2% |
| 20007 | 9029 | 5.7% |
| 20003 | 8015 | 5.0% |
| 20008 | 6801 | 4.3% |
| Other values (14) | 45822 |
| Value | Count | Frequency (%) |
| 20001 | 10549 | |
| 20002 | 16310 | |
| 20003 | 8015 | |
| 20004 | 1082 | 0.7% |
| 20005 | 3404 | 2.1% |
| 20006 | 118 | 0.1% |
| 20007 | 9029 | |
| 20008 | 6801 | |
| 20009 | 13171 | |
| 20010 | 6428 | 4.0% |
| Value | Count | Frequency (%) |
| 20392 | 186 | 0.1% |
| 20052 | 19 | < 0.1% |
| 20037 | 3730 | 2.3% |
| 20036 | 1892 | 1.2% |
| 20032 | 5111 | |
| 20024 | 3105 | 2.0% |
| 20020 | 9805 | |
| 20019 | 12458 | |
| 20018 | 5670 | |
| 20017 | 5622 |
| Distinct | 105949 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 52906 |
| Missing (%) | 33.3% |
| Memory size | 1.2 MiB |
| 18S UJ 28168 01936 | 5 |
|---|---|
| 18S UJ 28233 01950 | 5 |
| 18S UJ 28025 01949 | 4 |
| 18S UJ 28045 01888 | 4 |
| 18S UJ 25398 04622 | 4 |
| Other values (105944) |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Characters and Unicode
| Total characters | 1908918 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 105863 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | 18S UJ 23061 09289 |
|---|---|
| 2nd row | 18S UJ 23067 09289 |
| 3rd row | 18S UJ 23074 09289 |
| 4th row | 18S UJ 23078 09288 |
| 5th row | 18S UJ 23188 09253 |
Common Values
| Value | Count | Frequency (%) |
| 18S UJ 28168 01936 | 5 | < 0.1% |
| 18S UJ 28233 01950 | 5 | < 0.1% |
| 18S UJ 28025 01949 | 4 | < 0.1% |
| 18S UJ 28045 01888 | 4 | < 0.1% |
| 18S UJ 25398 04622 | 4 | < 0.1% |
| 18S UJ 26425 06527 | 3 | < 0.1% |
| 18S UJ 28027 01972 | 3 | < 0.1% |
| 18S UJ 21962 12164 | 3 | < 0.1% |
| 18S UJ 20689 08775 | 3 | < 0.1% |
| 18S UJ 29362 04313 | 2 | < 0.1% |
| Other values (105939) | 106015 | |
| (Missing) | 52906 |
Length
| Value | Count | Frequency (%) |
| 18s | 106051 | |
| uj | 104654 | |
| uh | 1397 | 0.3% |
| 13982 | 42 | < 0.1% |
| 09873 | 37 | < 0.1% |
| 24647 | 37 | < 0.1% |
| 26535 | 37 | < 0.1% |
| 24964 | 36 | < 0.1% |
| 27261 | 36 | < 0.1% |
| 07090 | 35 | < 0.1% |
| Other values (32714) | 211842 |
Most occurring characters
| Value | Count | Frequency (%) |
| 318153 | ||
| 1 | 247619 | |
| 8 | 191493 | |
| 2 | 163655 | |
| 0 | 140659 | |
| S | 106051 | 5.6% |
| U | 106051 | 5.6% |
| J | 104654 | 5.5% |
| 3 | 97866 | 5.1% |
| 7 | 89267 | 4.7% |
| Other values (5) | 343450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1272612 | |
| Uppercase Letter | 318153 | 16.7% |
| Space Separator | 318153 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 247619 | |
| 8 | 191493 | |
| 2 | 163655 | |
| 0 | 140659 | |
| 3 | 97866 | 7.7% |
| 7 | 89267 | 7.0% |
| 4 | 87068 | 6.8% |
| 6 | 86294 | 6.8% |
| 9 | 84497 | 6.6% |
| 5 | 84194 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 106051 | |
| U | 106051 | |
| J | 104654 | |
| H | 1397 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 318153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1590765 | |
| Latin | 318153 | 16.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 318153 | ||
| 1 | 247619 | |
| 8 | 191493 | |
| 2 | 163655 | |
| 0 | 140659 | |
| 3 | 97866 | 6.2% |
| 7 | 89267 | 5.6% |
| 4 | 87068 | 5.5% |
| 6 | 86294 | 5.4% |
| 9 | 84497 | 5.3% |
Latin
| Value | Count | Frequency (%) |
| S | 106051 | |
| U | 106051 | |
| J | 104654 | |
| H | 1397 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1908918 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 318153 | ||
| 1 | 247619 | |
| 8 | 191493 | |
| 2 | 163655 | |
| 0 | 140659 | |
| S | 106051 | 5.6% |
| U | 106051 | 5.6% |
| J | 104654 | 5.5% |
| 3 | 97866 | 5.1% |
| 7 | 89267 | 4.7% |
| Other values (5) | 343450 |
| Distinct | 105522 |
|---|---|
| Distinct (%) | 66.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.91485395 |
| Minimum | 38.81973129 |
|---|---|
| Maximum | 38.99553969 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 38.81973129 |
|---|---|
| 5-th percentile | 38.85933517 |
| Q1 | 38.89542487 |
| median | 38.91533652 |
| Q3 | 38.93607485 |
| 95-th percentile | 38.96507244 |
| Maximum | 38.99553969 |
| Range | 0.1758084 |
| Interquartile range (IQR) | 0.04064998 |
Descriptive statistics
| Standard deviation | 0.03172261554 |
|---|---|
| Coefficient of variation (CV) | 0.0008151801258 |
| Kurtosis | 0.0225011778 |
| Mean | 38.91485395 |
| Median Absolute Deviation (MAD) | 0.02030494 |
| Skewness | -0.2981973416 |
| Sum | 6185749.525 |
| Variance | 0.001006324337 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.93466835 | 1128 | 0.7% |
| 38.88082098 | 1022 | 0.6% |
| 38.9094433 | 592 | 0.4% |
| 38.8747892 | 524 | 0.3% |
| 38.90314058 | 504 | 0.3% |
| 38.94449932 | 429 | 0.3% |
| 38.89542487 | 428 | 0.3% |
| 38.86303776 | 410 | 0.3% |
| 38.90445577 | 406 | 0.3% |
| 38.92806083 | 367 | 0.2% |
| Other values (105512) | 153146 |
| Value | Count | Frequency (%) |
| 38.81973129 | 1 | |
| 38.81978931 | 1 | |
| 38.81988895 | 1 | |
| 38.819943 | 1 | |
| 38.81995335 | 1 | |
| 38.82001938 | 1 | |
| 38.82006029 | 1 | |
| 38.82011381 | 1 | |
| 38.82014001 | 1 | |
| 38.82020559 | 1 |
| Value | Count | Frequency (%) |
| 38.99553969 | 1 | |
| 38.9954352 | 1 | |
| 38.99530086 | 1 | |
| 38.99516273 | 1 | |
| 38.99503065 | 1 | |
| 38.99497139 | 1 | |
| 38.99489423 | 1 | |
| 38.99484815 | 1 | |
| 38.99479729 | 1 | |
| 38.99475116 | 1 |
| Distinct | 105935 |
|---|---|
| Distinct (%) | 66.6% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -77.01667632 |
| Minimum | -77.11390873 |
|---|---|
| Maximum | -76.90975796 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 158956 |
| Negative (%) | > 99.9% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | -77.11390873 |
|---|---|
| 5-th percentile | -77.08320993 |
| Q1 | -77.0428921 |
| median | -77.01959633 |
| Q3 | -76.98862646 |
| 95-th percentile | -76.94106208 |
| Maximum | -76.90975796 |
| Range | 0.20415077 |
| Interquartile range (IQR) | 0.0542656425 |
Descriptive statistics
| Standard deviation | 0.04093841016 |
|---|---|
| Coefficient of variation (CV) | -0.0005315525431 |
| Kurtosis | -0.3879446439 |
| Mean | -77.01667632 |
| Median Absolute Deviation (MAD) | 0.028281505 |
| Skewness | 0.1670056934 |
| Sum | -12242262.8 |
| Variance | 0.001675953426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -77.08468826 | 1128 | 0.7% |
| -77.01427073 | 1022 | 0.6% |
| -77.03969267 | 592 | 0.4% |
| -77.01630113 | 524 | 0.3% |
| -77.01777614 | 504 | 0.3% |
| -77.06124775 | 429 | 0.3% |
| -77.02156757 | 428 | 0.3% |
| -76.94956535 | 410 | 0.3% |
| -77.03105732 | 406 | 0.3% |
| -77.0792663 | 367 | 0.2% |
| Other values (105925) | 153146 |
| Value | Count | Frequency (%) |
| -77.11390873 | 1 | |
| -77.1138097 | 1 | |
| -77.11377421 | 1 | |
| -77.1136275 | 1 | |
| -77.11356932 | 1 | |
| -77.113389 | 1 | |
| -77.11332066 | 1 | |
| -77.1132754 | 1 | |
| -77.11327046 | 1 | |
| -77.11318887 | 1 |
| Value | Count | Frequency (%) |
| -76.90975796 | 1 | |
| -76.9097583 | 1 | |
| -76.90984266 | 1 | |
| -76.90984731 | 1 | |
| -76.90988281 | 1 | |
| -76.90989558 | 1 | |
| -76.9099699 | 1 | |
| -76.90998346 | 1 | |
| -76.91001813 | 1 | |
| -76.91002365 | 1 |
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 1.2 MiB |
| Old City 2 | |
|---|---|
| Old City 1 | |
| Columbia Heights | 9474 |
| Brookland | 6568 |
| Petworth | 6323 |
| Other values (52) |
Length
| Max length | 28 |
|---|---|
| Median length | 10 |
| Mean length | 11.51869071 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1830965 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Old City 2 |
|---|---|
| 2nd row | Old City 2 |
| 3rd row | Old City 2 |
| 4th row | Old City 2 |
| 5th row | Old City 2 |
Common Values
| Value | Count | Frequency (%) |
| Old City 2 | 15978 | 10.1% |
| Old City 1 | 15000 | 9.4% |
| Columbia Heights | 9474 | 6.0% |
| Brookland | 6568 | 4.1% |
| Petworth | 6323 | 4.0% |
| Deanwood | 5983 | 3.8% |
| Chevy Chase | 5354 | 3.4% |
| Congress Heights | 4729 | 3.0% |
| Brightwood | 4112 | 2.6% |
| Mt. Pleasant | 4052 | 2.5% |
| Other values (47) | 81383 |
Length
| Value | Count | Frequency (%) |
| city | 30978 | 10.2% |
| old | 30978 | 10.2% |
| heights | 24847 | 8.2% |
| 1 | 18132 | 6.0% |
| 2 | 15978 | 5.3% |
| park | 15730 | 5.2% |
| columbia | 9474 | 3.1% |
| brookland | 6568 | 2.2% |
| petworth | 6323 | 2.1% |
| deanwood | 5983 | 2.0% |
| Other values (67) | 137659 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 149450 | 8.2% |
| 143694 | 7.8% | |
| e | 129510 | 7.1% |
| i | 127197 | 6.9% |
| l | 118398 | 6.5% |
| o | 113961 | 6.2% |
| a | 106761 | 5.8% |
| r | 104884 | 5.7% |
| d | 75993 | 4.2% |
| s | 73438 | 4.0% |
| Other values (40) | 687679 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1372729 | |
| Uppercase Letter | 263832 | 14.4% |
| Space Separator | 143694 | 7.8% |
| Decimal Number | 41026 | 2.2% |
| Dash Punctuation | 5632 | 0.3% |
| Other Punctuation | 4052 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 149450 | |
| e | 129510 | |
| i | 127197 | |
| l | 118398 | 8.6% |
| o | 113961 | 8.3% |
| a | 106761 | 7.8% |
| r | 104884 | 7.6% |
| d | 75993 | 5.5% |
| s | 73438 | 5.3% |
| n | 67782 | 4.9% |
| Other values (13) | 305355 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 73052 | |
| H | 34945 | |
| O | 32431 | |
| P | 28807 | 10.9% |
| B | 15411 | 5.8% |
| D | 9408 | 3.6% |
| F | 9406 | 3.6% |
| W | 8626 | 3.3% |
| G | 7067 | 2.7% |
| S | 6952 | 2.6% |
| Other values (10) | 37727 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20340 | |
| 2 | 15978 | |
| 3 | 2500 | 6.1% |
| 6 | 2208 | 5.4% |
Space Separator
| Value | Count | Frequency (%) |
| 143694 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5632 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1636561 | |
| Common | 194404 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 149450 | 9.1% |
| e | 129510 | 7.9% |
| i | 127197 | 7.8% |
| l | 118398 | 7.2% |
| o | 113961 | 7.0% |
| a | 106761 | 6.5% |
| r | 104884 | 6.4% |
| d | 75993 | 4.6% |
| s | 73438 | 4.5% |
| C | 73052 | 4.5% |
| Other values (33) | 563917 |
Common
| Value | Count | Frequency (%) |
| 143694 | ||
| 1 | 20340 | 10.5% |
| 2 | 15978 | 8.2% |
| - | 5632 | 2.9% |
| . | 4052 | 2.1% |
| 3 | 2500 | 1.3% |
| 6 | 2208 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1830965 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 149450 | 8.2% |
| 143694 | 7.8% | |
| e | 129510 | 7.1% |
| i | 127197 | 6.9% |
| l | 118398 | 6.5% |
| o | 113961 | 6.2% |
| a | 106761 | 5.8% |
| r | 104884 | 5.7% |
| d | 75993 | 4.2% |
| s | 73438 | 4.0% |
| Other values (40) | 687679 |
| Distinct | 121 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 32551 |
| Missing (%) | 20.5% |
| Memory size | 1.2 MiB |
| 040 D Old City 2 | 4403 |
|---|---|
| 040 E Old City 2 | 2968 |
| 040 C Old City 2 | 2886 |
| 042 B Petworth | 2763 |
| 039 K Old City 1 | 2640 |
| Other values (116) |
Length
| Max length | 25 |
|---|---|
| Median length | 16 |
| Mean length | 17.13591127 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2166082 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 040 D Old City 2 |
|---|---|
| 2nd row | 040 D Old City 2 |
| 3rd row | 040 D Old City 2 |
| 4th row | 040 D Old City 2 |
| 5th row | 040 D Old City 2 |
Common Values
| Value | Count | Frequency (%) |
| 040 D Old City 2 | 4403 | 2.8% |
| 040 E Old City 2 | 2968 | 1.9% |
| 040 C Old City 2 | 2886 | 1.8% |
| 042 B Petworth | 2763 | 1.7% |
| 039 K Old City 1 | 2640 | 1.7% |
| 007 E Brookland | 2388 | 1.5% |
| 040 B Old City 2 | 2289 | 1.4% |
| 015 D Columbia Heights | 2246 | 1.4% |
| 015 A Columbia Heights | 2206 | 1.4% |
| 015 E Columbia Heights | 2183 | 1.4% |
| Other values (111) | 99434 | |
| (Missing) | 32551 | 20.5% |
Length
| Value | Count | Frequency (%) |
| b | 32080 | 6.5% |
| city | 30978 | 6.3% |
| old | 30978 | 6.3% |
| a | 29585 | 6.0% |
| c | 25558 | 5.2% |
| heights | 23663 | 4.8% |
| 040 | 15978 | 3.2% |
| 2 | 15978 | 3.2% |
| 1 | 15000 | 3.0% |
| 039 | 15000 | 3.0% |
| Other values (79) | 259231 |
Most occurring characters
| Value | Count | Frequency (%) |
| 367623 | ||
| 0 | 165735 | 7.7% |
| t | 115579 | 5.3% |
| i | 106591 | 4.9% |
| e | 94632 | 4.4% |
| o | 90288 | 4.2% |
| l | 88407 | 4.1% |
| C | 85277 | 3.9% |
| a | 73869 | 3.4% |
| d | 68048 | 3.1% |
| Other values (44) | 910033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1045358 | |
| Decimal Number | 414612 | 19.1% |
| Space Separator | 367623 | 17.0% |
| Uppercase Letter | 334437 | 15.4% |
| Other Punctuation | 4052 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 85277 | |
| B | 45279 | |
| A | 34103 | 10.2% |
| H | 32931 | 9.8% |
| O | 30978 | 9.3% |
| D | 22102 | 6.6% |
| P | 18322 | 5.5% |
| E | 13494 | 4.0% |
| F | 7174 | 2.1% |
| M | 6780 | 2.0% |
| Other values (11) | 37997 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 115579 | |
| i | 106591 | |
| e | 94632 | 9.1% |
| o | 90288 | 8.6% |
| l | 88407 | 8.5% |
| a | 73869 | 7.1% |
| d | 68048 | 6.5% |
| r | 61254 | 5.9% |
| s | 58370 | 5.6% |
| n | 51075 | 4.9% |
| Other values (11) | 237245 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 165735 | |
| 1 | 55999 | 13.5% |
| 2 | 46301 | 11.2% |
| 4 | 31938 | 7.7% |
| 3 | 30310 | 7.3% |
| 9 | 26573 | 6.4% |
| 5 | 22808 | 5.5% |
| 6 | 17849 | 4.3% |
| 8 | 10518 | 2.5% |
| 7 | 6581 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 367623 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1379795 | |
| Common | 786287 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 115579 | 8.4% |
| i | 106591 | 7.7% |
| e | 94632 | 6.9% |
| o | 90288 | 6.5% |
| l | 88407 | 6.4% |
| C | 85277 | 6.2% |
| a | 73869 | 5.4% |
| d | 68048 | 4.9% |
| r | 61254 | 4.4% |
| s | 58370 | 4.2% |
| Other values (32) | 537480 |
Common
| Value | Count | Frequency (%) |
| 367623 | ||
| 0 | 165735 | |
| 1 | 55999 | 7.1% |
| 2 | 46301 | 5.9% |
| 4 | 31938 | 4.1% |
| 3 | 30310 | 3.9% |
| 9 | 26573 | 3.4% |
| 5 | 22808 | 2.9% |
| 6 | 17849 | 2.3% |
| 8 | 10518 | 1.3% |
| Other values (2) | 10633 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2166082 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 367623 | ||
| 0 | 165735 | 7.7% |
| t | 115579 | 5.3% |
| i | 106591 | 4.9% |
| e | 94632 | 4.4% |
| o | 90288 | 4.2% |
| l | 88407 | 4.1% |
| C | 85277 | 3.9% |
| a | 73869 | 3.4% |
| d | 68048 | 3.1% |
| Other values (44) | 910033 |
| Distinct | 176 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5348.216324 |
| Minimum | 100 |
|---|---|
| Maximum | 11100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 502 |
| Q1 | 2102 |
| median | 5201 |
| Q3 | 8302 |
| 95-th percentile | 10200 |
| Maximum | 11100 |
| Range | 11000 |
| Interquartile range (IQR) | 6200 |
Descriptive statistics
| Standard deviation | 3369.645953 |
|---|---|
| Coefficient of variation (CV) | 0.6300504222 |
| Kurtosis | -1.425048885 |
| Mean | 5348.216324 |
| Median Absolute Deviation (MAD) | 3100 |
| Skewness | 0.007889343771 |
| Sum | 850131074 |
| Variance | 11354513.85 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5500 | 2933 | 1.8% |
| 801 | 2620 | 1.6% |
| 1001 | 2552 | 1.6% |
| 300 | 2182 | 1.4% |
| 5301 | 2179 | 1.4% |
| 100 | 2090 | 1.3% |
| 1500 | 2081 | 1.3% |
| 4400 | 1960 | 1.2% |
| 1100 | 1879 | 1.2% |
| 5201 | 1766 | 1.1% |
| Other values (166) | 136714 |
| Value | Count | Frequency (%) |
| 100 | 2090 | |
| 202 | 1684 | |
| 300 | 2182 | |
| 400 | 605 | 0.4% |
| 501 | 519 | 0.3% |
| 502 | 1023 | 0.6% |
| 600 | 1291 | |
| 701 | 1290 | |
| 702 | 696 | 0.4% |
| 801 | 2620 |
| Value | Count | Frequency (%) |
| 11100 | 1501 | |
| 11000 | 911 | |
| 10900 | 160 | 0.1% |
| 10800 | 386 | 0.2% |
| 10700 | 392 | 0.2% |
| 10600 | 1317 | |
| 10500 | 1022 | |
| 10400 | 945 | |
| 10300 | 773 | |
| 10200 | 895 |
| Distinct | 3848 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 52906 |
| Missing (%) | 33.3% |
| Memory size | 1.2 MiB |
| 009000 1001 | 340 |
|---|---|
| 009201 1004 | 312 |
| 009509 3004 | 206 |
| 009904 2009 | 204 |
| 009508 2005 | 195 |
| Other values (3843) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1166561 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 65 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 004201 2006 |
|---|---|
| 2nd row | 004201 2006 |
| 3rd row | 004201 2006 |
| 4th row | 004201 2006 |
| 5th row | 004201 2006 |
Common Values
| Value | Count | Frequency (%) |
| 009000 1001 | 340 | 0.2% |
| 009201 1004 | 312 | 0.2% |
| 009509 3004 | 206 | 0.1% |
| 009904 2009 | 204 | 0.1% |
| 009508 2005 | 195 | 0.1% |
| 009000 1010 | 189 | 0.1% |
| 007809 1001 | 175 | 0.1% |
| 000300 3003 | 170 | 0.1% |
| 009700 1012 | 160 | 0.1% |
| 000801 2008 | 158 | 0.1% |
| Other values (3838) | 103942 | |
| (Missing) | 52906 |
Length
| Value | Count | Frequency (%) |
| 1001 | 3969 | 1.9% |
| 1004 | 3712 | 1.8% |
| 1002 | 3594 | 1.7% |
| 2000 | 3378 | 1.6% |
| 1003 | 3290 | 1.6% |
| 2005 | 3139 | 1.5% |
| 2002 | 3101 | 1.5% |
| 1006 | 3030 | 1.4% |
| 1005 | 2991 | 1.4% |
| 2001 | 2960 | 1.4% |
| Other values (369) | 178938 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 565517 | |
| 1 | 135208 | 11.6% |
| 106051 | 9.1% | |
| 2 | 97511 | 8.4% |
| 3 | 52275 | 4.5% |
| 9 | 43096 | 3.7% |
| 4 | 41593 | 3.6% |
| 7 | 35999 | 3.1% |
| 8 | 32817 | 2.8% |
| 5 | 28786 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1060510 | |
| Space Separator | 106051 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 565517 | |
| 1 | 135208 | 12.7% |
| 2 | 97511 | 9.2% |
| 3 | 52275 | 4.9% |
| 9 | 43096 | 4.1% |
| 4 | 41593 | 3.9% |
| 7 | 35999 | 3.4% |
| 8 | 32817 | 3.1% |
| 5 | 28786 | 2.7% |
| 6 | 27708 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 106051 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1166561 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 565517 | |
| 1 | 135208 | 11.6% |
| 106051 | 9.1% | |
| 2 | 97511 | 8.4% |
| 3 | 52275 | 4.5% |
| 9 | 43096 | 3.7% |
| 4 | 41593 | 3.6% |
| 7 | 35999 | 3.1% |
| 8 | 32817 | 2.8% |
| 5 | 28786 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1166561 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 565517 | |
| 1 | 135208 | 11.6% |
| 106051 | 9.1% | |
| 2 | 97511 | 8.4% |
| 3 | 52275 | 4.5% |
| 9 | 43096 | 3.7% |
| 4 | 41593 | 3.6% |
| 7 | 35999 | 3.1% |
| 8 | 32817 | 2.8% |
| 5 | 28786 | 2.5% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 1.2 MiB |
| Ward 6 | |
|---|---|
| Ward 3 | |
| Ward 4 | |
| Ward 2 | |
| Ward 5 | |
| Other values (3) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 953736 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ward 2 |
|---|---|
| 2nd row | Ward 2 |
| 3rd row | Ward 2 |
| 4th row | Ward 2 |
| 5th row | Ward 2 |
Common Values
| Value | Count | Frequency (%) |
| Ward 6 | 23973 | |
| Ward 3 | 23688 | |
| Ward 4 | 22202 | |
| Ward 2 | 22167 | |
| Ward 5 | 21359 | |
| Ward 1 | 17455 | |
| Ward 7 | 17206 | |
| Ward 8 | 10906 | |
| (Missing) | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| ward | 158956 | |
| 6 | 23973 | 7.5% |
| 3 | 23688 | 7.5% |
| 4 | 22202 | 7.0% |
| 2 | 22167 | 7.0% |
| 5 | 21359 | 6.7% |
| 1 | 17455 | 5.5% |
| 7 | 17206 | 5.4% |
| 8 | 10906 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 158956 | |
| a | 158956 | |
| r | 158956 | |
| d | 158956 | |
| 158956 | ||
| 6 | 23973 | 2.5% |
| 3 | 23688 | 2.5% |
| 4 | 22202 | 2.3% |
| 2 | 22167 | 2.3% |
| 5 | 21359 | 2.2% |
| Other values (3) | 45567 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 476868 | |
| Uppercase Letter | 158956 | 16.7% |
| Space Separator | 158956 | 16.7% |
| Decimal Number | 158956 | 16.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 23973 | |
| 3 | 23688 | |
| 4 | 22202 | |
| 2 | 22167 | |
| 5 | 21359 | |
| 1 | 17455 | |
| 7 | 17206 | |
| 8 | 10906 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 158956 | |
| r | 158956 | |
| d | 158956 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 158956 |
Space Separator
| Value | Count | Frequency (%) |
| 158956 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 635824 | |
| Common | 317912 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 158956 | ||
| 6 | 23973 | 7.5% |
| 3 | 23688 | 7.5% |
| 4 | 22202 | 7.0% |
| 2 | 22167 | 7.0% |
| 5 | 21359 | 6.7% |
| 1 | 17455 | 5.5% |
| 7 | 17206 | 5.4% |
| 8 | 10906 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| W | 158956 | |
| a | 158956 | |
| r | 158956 | |
| d | 158956 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 953736 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 158956 | |
| a | 158956 | |
| r | 158956 | |
| d | 158956 | |
| 158956 | ||
| 6 | 23973 | 2.5% |
| 3 | 23688 | 2.5% |
| 4 | 22202 | 2.3% |
| 2 | 22167 | 2.3% |
| 5 | 21359 | 2.2% |
| Other values (3) | 45567 | 4.8% |
| Distinct | 3291 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 237 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -77.01671188 |
| Minimum | -77.11313486 |
|---|---|
| Maximum | -76.91051093 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 158720 |
| Negative (%) | 99.9% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | -77.11313486 |
|---|---|
| 5-th percentile | -77.08299299 |
| Q1 | -77.04289439 |
| median | -77.01958148 |
| Q3 | -76.98884235 |
| 95-th percentile | -76.94112875 |
| Maximum | -76.91051093 |
| Range | 0.2026239377 |
| Interquartile range (IQR) | 0.05405204484 |
Descriptive statistics
| Standard deviation | 0.04093318544 |
|---|---|
| Coefficient of variation (CV) | -0.0005314844589 |
| Kurtosis | -0.3867357071 |
| Mean | -77.01671188 |
| Median Absolute Deviation (MAD) | 0.02825954114 |
| Skewness | 0.1677871257 |
| Sum | -12224092.51 |
| Variance | 0.00167552567 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -77.08469058 | 1366 | 0.9% |
| -77.01427301 | 1022 | 0.6% |
| -77.0751295 | 721 | 0.5% |
| -77.03969497 | 600 | 0.4% |
| -76.95977324 | 559 | 0.4% |
| -77.01630341 | 524 | 0.3% |
| -77.01777843 | 504 | 0.3% |
| -77.06125007 | 430 | 0.3% |
| -77.02156986 | 428 | 0.3% |
| -76.94956761 | 415 | 0.3% |
| Other values (3281) | 152151 |
| Value | Count | Frequency (%) |
| -77.11313486 | 24 | |
| -77.11177658 | 19 | < 0.1% |
| -77.11153572 | 25 | |
| -77.11045938 | 32 | |
| -77.11044559 | 17 | < 0.1% |
| -77.10924966 | 40 | |
| -77.10903087 | 19 | < 0.1% |
| -77.10835117 | 40 | |
| -77.10825751 | 52 | |
| -77.1081805 | 58 |
| Value | Count | Frequency (%) |
| -76.91051093 | 39 | |
| -76.91143455 | 9 | < 0.1% |
| -76.91163527 | 24 | |
| -76.91276833 | 33 | |
| -76.9128207 | 17 | |
| -76.91303377 | 19 | |
| -76.91342738 | 2 | < 0.1% |
| -76.91417499 | 23 | |
| -76.91426893 | 8 | < 0.1% |
| -76.91437363 | 13 | < 0.1% |
| Distinct | 3291 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 237 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.91484631 |
| Minimum | 38.82057613 |
|---|---|
| Maximum | 38.99364643 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.2 MiB |
Quantile statistics
| Minimum | 38.82057613 |
|---|---|
| 5-th percentile | 38.85937727 |
| Q1 | 38.89543232 |
| median | 38.91522932 |
| Q3 | 38.93607698 |
| 95-th percentile | 38.9646814 |
| Maximum | 38.99364643 |
| Range | 0.1730703058 |
| Interquartile range (IQR) | 0.04064465254 |
Descriptive statistics
| Standard deviation | 0.03168182178 |
|---|---|
| Coefficient of variation (CV) | 0.0008141320032 |
| Kurtosis | 0.02283935778 |
| Mean | 38.91484631 |
| Median Absolute Deviation (MAD) | 0.02021112457 |
| Skewness | -0.3010348985 |
| Sum | 6176564.406 |
| Variance | 0.001003737831 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.93467581 | 1366 | 0.9% |
| 38.88082843 | 1022 | 0.6% |
| 38.92025156 | 721 | 0.5% |
| 38.90945076 | 600 | 0.4% |
| 38.92507725 | 559 | 0.4% |
| 38.87479665 | 524 | 0.3% |
| 38.90314803 | 504 | 0.3% |
| 38.94450678 | 430 | 0.3% |
| 38.89543232 | 428 | 0.3% |
| 38.92961295 | 415 | 0.3% |
| Other values (3281) | 152151 |
| Value | Count | Frequency (%) |
| 38.82057613 | 36 | < 0.1% |
| 38.82179948 | 115 | |
| 38.82189368 | 9 | < 0.1% |
| 38.82346224 | 20 | < 0.1% |
| 38.82377817 | 23 | < 0.1% |
| 38.82435976 | 61 | |
| 38.82459455 | 123 | |
| 38.82494627 | 40 | < 0.1% |
| 38.82554031 | 48 | < 0.1% |
| 38.82554343 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 38.99364643 | 36 | |
| 38.99360337 | 30 | |
| 38.99343456 | 20 | < 0.1% |
| 38.99303037 | 14 | < 0.1% |
| 38.9917034 | 14 | < 0.1% |
| 38.99161793 | 42 | |
| 38.99116239 | 11 | < 0.1% |
| 38.99001424 | 51 | |
| 38.98977116 | 41 | |
| 38.98972573 | 45 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 237 |
| Missing (%) | 0.1% |
| Memory size | 1.2 MiB |
| NW | |
|---|---|
| NE | |
| SE | |
| SW | 4085 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 317440 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NW |
|---|---|
| 2nd row | NW |
| 3rd row | NW |
| 4th row | NW |
| 5th row | NW |
Common Values
| Value | Count | Frequency (%) |
| NW | 89736 | |
| NE | 37675 | |
| SE | 27224 | 17.1% |
| SW | 4085 | 2.6% |
| (Missing) | 237 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| nw | 89736 | |
| ne | 37675 | |
| se | 27224 | 17.2% |
| sw | 4085 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 127411 | |
| W | 93821 | |
| E | 64899 | |
| S | 31309 | 9.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 317440 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 127411 | |
| W | 93821 | |
| E | 64899 | |
| S | 31309 | 9.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 317440 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 127411 | |
| W | 93821 | |
| E | 64899 | |
| S | 31309 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317440 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 127411 | |
| W | 93821 | |
| E | 64899 | |
| S | 31309 | 9.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | BATHRM | HF_BATHRM | HEAT | AC | NUM_UNITS | ROOMS | BEDRM | AYB | YR_RMDL | EYB | STORIES | SALEDATE | PRICE | QUALIFIED | SALE_NUM | GBA | BLDG_NUM | STYLE | STRUCT | GRADE | CNDTN | EXTWALL | ROOF | INTWALL | KITCHENS | FIREPLACES | USECODE | LANDAREA | GIS_LAST_MOD_DTTM | SOURCE | CMPLX_NUM | LIVING_GBA | FULLADDRESS | CITY | STATE | ZIPCODE | NATIONALGRID | LATITUDE | LONGITUDE | ASSESSMENT_NBHD | ASSESSMENT_SUBNBHD | CENSUS_TRACT | CENSUS_BLOCK | WARD | SQUARE | X | Y | QUADRANT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 4 | 0 | Warm Cool | Y | 2.0 | 8 | 4 | 1910.0 | 1988.0 | 1972 | 3.0 | 2003-11-25 00:00:00 | 1095000.0 | Q | 1 | 2522.0 | 1 | 3 Story | Row Inside | Very Good | Good | Common Brick | Metal- Sms | Hardwood | 2.0 | 5 | 24 | 1680 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1748 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23061 09289 | 38.914680 | -77.040832 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 1 | 1 | 3 | 1 | Warm Cool | Y | 2.0 | 11 | 5 | 1898.0 | 2007.0 | 1972 | 3.0 | 2000-08-17 00:00:00 | NaN | U | 1 | 2567.0 | 1 | 3 Story | Row Inside | Very Good | Good | Common Brick | Built Up | Hardwood | 2.0 | 4 | 24 | 1680 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1746 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23067 09289 | 38.914683 | -77.040764 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 2 | 2 | 3 | 1 | Hot Water Rad | Y | 2.0 | 9 | 5 | 1910.0 | 2009.0 | 1984 | 3.0 | 2016-06-21 00:00:00 | 2100000.0 | Q | 3 | 2522.0 | 1 | 3 Story | Row Inside | Very Good | Very Good | Common Brick | Built Up | Hardwood | 2.0 | 4 | 24 | 1680 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1744 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23074 09289 | 38.914684 | -77.040678 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 3 | 3 | 3 | 1 | Hot Water Rad | Y | 2.0 | 8 | 5 | 1900.0 | 2003.0 | 1984 | 3.0 | 2006-07-12 00:00:00 | 1602000.0 | Q | 1 | 2484.0 | 1 | 3 Story | Row Inside | Very Good | Good | Common Brick | Built Up | Hardwood | 2.0 | 3 | 24 | 1680 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1742 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23078 09288 | 38.914683 | -77.040629 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 4 | 4 | 2 | 1 | Warm Cool | Y | 1.0 | 11 | 3 | 1913.0 | 2012.0 | 1985 | 3.0 | NaN | NaN | U | 1 | 5255.0 | 1 | 3 Story | Semi-Detached | Very Good | Good | Common Brick | Neopren | Hardwood | 1.0 | 0 | 13 | 2032 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1804 NEW HAMPSHIRE AVENUE NW | WASHINGTON | DC | 20009.0 | 18S UJ 23188 09253 | 38.914383 | -77.039361 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 5 | 5 | 3 | 2 | Hot Water Rad | Y | 1.0 | 10 | 5 | 1913.0 | NaN | 1972 | 4.0 | 2010-02-26 00:00:00 | 1950000.0 | Q | 1 | 5344.0 | 1 | 4 Story | Row Inside | Very Good | Good | Common Brick | Built Up | Hardwood | 1.0 | 4 | 11 | 2196 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1709 S STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23157 09248 | 38.914331 | -77.039715 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2006 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 6 | 6 | 1 | 0 | Warm Cool | Y | 2.0 | 5 | 2 | 1917.0 | 1988.0 | 1957 | 2.0 | 2011-05-02 00:00:00 | NaN | U | 1 | 1260.0 | 1 | 2 Story | Row Inside | Above Average | Average | Common Brick | Metal- Sms | Hardwood | 2.0 | 0 | 24 | 1261 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1769 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23042 09323 | 38.914983 | -77.041055 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2005 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 7 | 7 | 3 | 1 | Hot Water Rad | Y | 2.0 | 8 | 4 | 1906.0 | 2011.0 | 1972 | 3.0 | 2011-09-29 00:00:00 | 1050000.0 | Q | 1 | 2401.0 | 1 | 3 Story | Row Inside | Very Good | Average | Common Brick | Metal- Sms | Hardwood | 2.0 | 1 | 24 | 1627 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1746 1/2 T STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23124 09368 | 38.915408 | -77.040129 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2005 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 8 | 8 | 3 | 1 | Warm Cool | Y | 2.0 | 7 | 3 | 1908.0 | 2008.0 | 1967 | 2.0 | 2018-05-03 00:00:00 | 1430000.0 | Q | 4 | 1488.0 | 1 | 2 Story | Row Inside | Above Average | Very Good | Common Brick | Built Up | Hardwood | 2.0 | 1 | 24 | 1424 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1727 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23142 09324 | 38.915017 | -77.039903 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2005 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
| 9 | 9 | 1 | 1 | Hot Water Rad | Y | 1.0 | 6 | 2 | 1908.0 | 1979.0 | 1950 | 2.0 | 2008-12-05 00:00:00 | NaN | U | 1 | 1590.0 | 1 | 2 Story | Row Inside | Good Quality | Average | Common Brick | Built Up | Hardwood | 1.0 | 0 | 11 | 1424 | 2018-07-22 18:01:43 | Residential | NaN | NaN | 1733 SWANN STREET NW | WASHINGTON | DC | 20009.0 | 18S UJ 23127 09324 | 38.915015 | -77.040081 | Old City 2 | 040 D Old City 2 | 4201.0 | 004201 2005 | Ward 2 | 152 | -77.040429 | 38.914881 | NW |
Last rows
| Unnamed: 0 | BATHRM | HF_BATHRM | HEAT | AC | NUM_UNITS | ROOMS | BEDRM | AYB | YR_RMDL | EYB | STORIES | SALEDATE | PRICE | QUALIFIED | SALE_NUM | GBA | BLDG_NUM | STYLE | STRUCT | GRADE | CNDTN | EXTWALL | ROOF | INTWALL | KITCHENS | FIREPLACES | USECODE | LANDAREA | GIS_LAST_MOD_DTTM | SOURCE | CMPLX_NUM | LIVING_GBA | FULLADDRESS | CITY | STATE | ZIPCODE | NATIONALGRID | LATITUDE | LONGITUDE | ASSESSMENT_NBHD | ASSESSMENT_SUBNBHD | CENSUS_TRACT | CENSUS_BLOCK | WARD | SQUARE | X | Y | QUADRANT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 158947 | 158947 | 2 | 0 | Forced Air | Y | NaN | 4 | 2 | 1938.0 | 2006.0 | 1938 | NaN | 2008-06-30 00:00:00 | 320000.0 | U | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 497 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 809.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158948 | 158948 | 2 | 0 | Forced Air | Y | NaN | 4 | 2 | 1938.0 | 2006.0 | 1938 | NaN | 2012-10-22 00:00:00 | 460000.0 | Q | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 573 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 934.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158949 | 158949 | 1 | 1 | Forced Air | Y | NaN | 4 | 1 | 1938.0 | 2006.0 | 1938 | NaN | 2015-06-09 00:00:00 | 550000.0 | Q | 6 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 690 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 1123.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158950 | 158950 | 3 | 0 | Forced Air | Y | NaN | 5 | 3 | 1938.0 | 2006.0 | 1938 | NaN | 2015-12-24 00:00:00 | 635000.0 | U | 5 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 407 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 1330.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158951 | 158951 | 3 | 1 | Forced Air | Y | NaN | 5 | 3 | 1938.0 | 2006.0 | 1938 | NaN | 2009-11-12 00:00:00 | 389000.0 | U | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 502 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 1413.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158952 | 158952 | 1 | 0 | Forced Air | Y | NaN | 3 | 1 | 1938.0 | 2006.0 | 1938 | NaN | 2015-04-03 00:00:00 | 399900.0 | Q | 4 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 394 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 639.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158953 | 158953 | 1 | 0 | Forced Air | Y | NaN | 4 | 2 | 1938.0 | 2006.0 | 1938 | NaN | 2013-10-04 00:00:00 | 416000.0 | Q | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 506 | 2018-07-22 18:01:38 | Condominium | 2786.0 | 820.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158954 | 158954 | 2 | 0 | Forced Air | Y | NaN | 4 | 2 | 1920.0 | 2007.0 | 1920 | NaN | 2008-09-30 00:00:00 | 600000.0 | U | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 16 | 467 | 2018-07-22 18:01:38 | Condominium | 2880.0 | 1167.0 | NaN | NaN | NaN | 20001.0 | NaN | 38.911840 | -77.01942 | Old City 2 | 040 B Old City 2 | 4801.0 | NaN | Ward 6 | 477 | -77.019422 | 38.911848 | NW |
| 158955 | 158955 | 1 | 0 | Warm Cool | Y | NaN | 2 | 0 | 1965.0 | NaN | 1965 | NaN | 2015-04-14 00:00:00 | 215100.0 | Q | 3 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 17 | 332 | 2018-07-22 18:01:38 | Condominium | 2275.0 | 447.0 | NaN | NaN | NaN | 20024.0 | NaN | 38.872953 | -77.01823 | Southwest Waterfront | NaN | 11000.0 | NaN | Ward 6 | 504 | -77.018232 | 38.872961 | SW |
| 158956 | 158956 | 1 | 0 | Warm Cool | Y | NaN | 2 | 0 | 1965.0 | NaN | 1965 | NaN | 2002-07-22 00:00:00 | NaN | U | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 | 17 | 332 | 2018-07-22 18:01:38 | Condominium | 2275.0 | 447.0 | NaN | NaN | NaN | 20024.0 | NaN | 38.872953 | -77.01823 | Southwest Waterfront | NaN | 11000.0 | NaN | Ward 6 | 504 | -77.018232 | 38.872961 | SW |